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1  Introduction 


Many  of  the  envisioned  future  Army  missions  will  involve  high  performance  feedback  control 
of  uncertain  nonlinear  systems.  For  example,  control  and  stabilization  of  helicopter  based 
weapons  platforms,  precision  gun  stabilization  and  pointing,  and  the  use  of  lightweight  robots 
for  munitions  loading  all  pose  difficult  nonlinear  control  problems.  In  general,  nonlinearities 
in  mechanical  systems  arise  from  a  number  of  sources,  such  as,  actuator  saturation,  friction  in 
bearings  or  gear  trains,  backlash  due  to  gear  spacing  or  slippage,  kinematic  transformations, 
and  aerodynamic  forces.  Additionally,  many  of  these  systems  are  also  flexible,  with  nonlinear 
damping  mechanisms.  In  many  cases  the  type  of  nonlinearity  as  well  as  their  parameter 
values  are  not  very  well  known  and  prior  knowledge  is  too  coarse  to  guarantee  acceptable 
closed-loop  performance.  Under  these  conditions,  it  is  necessary  to  adapt  the  controller  by 
estimating  the  uncertain  model  parameters  from  on-line  data. 


1.1  Research  Objectives 

The  objective  of  this  work  is  the  development  of  adaptive  control  methods  which  can  signif¬ 
icantly  improve  closed-loop  performance  for  a  broad  class  of  nonlinear  flexible  systems. 

Towards  this  end,  the  goals  were  as  follows: 

1.  Form  a  parametric  model  representing  a  broad  class  of  nonlinear  flexible  systems. 

2.  Design  a  nonlinear  controller  based  on  the  parametric  model  which  provides  desired 
closed-loop  performance  if  the  parameters  were  known. 

3.  Identify  the  parameters  of  the  system  using  measured  data  and  use  these  estimates  in 
the  controller. 

4.  Analyze  the  stability  and  performance  properties  of  the  complete  adaptive  system  using 
the  method  of  averaging. 

We  concentrated  most  of  our  effort  on  items  1  and  2  as  these  turned  out  to  be  the  most 
difficult  and  challenging  tasks.  These  must  first  be  resolved  in  order  to  proceed  to  adaptation 
(items  3  and  4). 

In  general,  the  final  adaptive  system  described  above  fits  into  the  generic  adaptive  scheme 
depicted  in  Figure  1.1. 

In  this  traditional  adaptive  control  system,  often  referred  to  as  the  self-tuning-regulator 
(STR),  [2],  the  identified  model  is  usually  selected  out  of  a  model  set  with  unknown  pa¬ 
rameters.  The  controller  is  designed  as  if  the  parameter  estimates  were  in  fact  the  correct 
parameters  for  describing  the  plant.  In  the  ideal  case  it  is  assumed  that  there  exist  parame¬ 
ters,  which  if  known,  would  precisely  account  for  the  measured  data.  Even  in  this  ideal  case, 
the  transient  errors  between  the  identified  model  and  the  true  system  can  be  so  large  as  to 
completely  disrupt  performance.  In  the  usual  (non-ideal)  case  -  the  true  system  is  not  in 
the  model  set  -  both  unacceptable  transient  or  asymptotic  behavior  can  occur  e.g.,  [1], 
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Estimated  Parameters 


Figure  1.1:  Adaptive  control  system  with  parameter  estimator. 

The  above  cited  problems  are  well  documented  in  the  case  of  adaptive  linear  control, 
t.e.,  when  the  adaptive  parameters  are  held  fixed  the  closed- loop  system  is  linear.  However, 
the  case  considered  here  is  adaptive  nonlinear  control,  t.e.,  when  the  adaptive  parameters 
are  held  fixed  the  closed-loop  system  is  nonlinear.  Naturally  the  same  problems  will  arise 
in  this  case  as  in  the  adaptive  linear  case.  Unfortunately  many  of  the  controller  design  and 
parameter  estimator  issues  have  not  been  developed  sufficiently  as  yet.  In  consequence,  the 
first  step  has  been  to  resolve  some  of  these  issues. 

1.2  Publications 

The  following  is  a  list  of  publications  during  the  reporting  period.  Items  2,3  and  4  are 
included  as  Appendices. 

1.  “On  sensitivity  of  adaptive  feedback  linearization:  a  case  study,”  Robert  L.  Kosut  and 
M.  Guntekin  Kabuli,  Proceedings  of  the  7th  Yale  Workshop  on  Adaptive  and  Learning 
Systems, New  Haven,  Connecticut,  May  20-22,  1992. 

2.  “Adaptive  feedback  linearization:  implementability  and  robustness,”  M.  Guntekin 
Kabuli  and  Robert  L.  Kosut,  Proceedings  of  the  Slst  IEEE  Conference  on  Decision 
and  Control,  pp.  251-256,  Tucson,  Arizona,  December  1992. 

3.  “On  feedback  linearizable  plants,”  M.  Guntekin  Kabuli  and  Robert  L.  Kosut,  Pro¬ 
ceedings  of  the  American  Control  Conference,  pp.  1186-1190,  San  Francisco,  June 
1993. 

4.  “Real-time  implementation  issues  in  nonlinear  model  inversion,”  M.  Guntekin  Kab¬ 
uli,  Sudarshan  P.  Bhat  and  Robert  L.  Kosut,  Proceedings  of  the  American  Control 
Conference,  pp.  547-551,  San  Francisco,  June  1993. 


1.3  Personnel 


Dr.  Robert  Kosut  and  Dr.  M.  Guntekin  Kabuli  worked  on  the  project  during  the  reporting 
period  1  February  1991  through  30  April  1994. 

1.4  Interactions 

We  have  had  several  interactions  with  Norm  Coleman’s  group  at  ARDEC.  As  a  proof-of- 
concept,  the  proposed  model-follower  based  feedforward- feedback  control  design  was  applied 
to  the  analytical  model  of  the  Advanced  Weapons  Testbed  at  ARDEC,  which  is  a  central 
topic  of  the  ARO  Workshops  on  Real-Time  Control.  The  findings  were  presented  at  the  2nd 
ARO  Workshop  on  Real-Time  Control  at  ARDEC,  17-18  June  1992. 

Some  of  the  work  reported  here  was  jointly  supported  by  the  AFOSR  Directorate  of 
Aerospace  Sciences,  under  contract  F49620-90-C-0064  . 
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Technical  Discussion 


2.1  Overview 

With  few  exceptions,  practically  all  of  recent  research  in  adaptive  control  is  for  the  case  of 
parameter  adaptive  linear  systems,  i.e.,  when  the  adaptive  parameters  are  held  fixed,  the 
resulting  system  is  linear-time-invariant  (LTI),  e.g.,  [1,  2].  In  our  current  effort,  we  have 
examined  parameter  adaptive  nonlinear  systems,  that  is,  when  the  adaptive  parameters  are 
held  fixed,  the  resulting  closed-loop  system  is  nonlinear.  Although  this  is  an  active  field  of 
research,  (see,  e.g.  [10,  14])  our  efforts  to  date  have  convinced  us  that  parameter  adaptation 
cannot  be  introduced  unless  there  is  an  adequate  nonlinear  control  design.  Specifically,  a 
necessary  first  step  to  realizing  a  practical  adaptive  nonlinear  control  is  to  design  a  nonlinear 
control  system  which  is  robust  to  parameter  variations.  Because  of  this,  in  our  most  recent 
efforts  we  have  concentrated  exclusively  on  the  design  of  a  (non-adaptive)  nonlinear  robust 
control  system. 


I - J  l_ - - - ........ - .... _ - _ J 


Feedforward  Feedback 

System  System 

Figure  2.1:  Model-follower  nonlinear  feedback  system. 

Our  current  research  in  this  direction  has  led  us  to  the  configuration  shown  in  Figure  2.1, 
which  we  refer  to  as  a  model-following  feedback  system.  We  have  discovered  that  this  scheme 
is  very  well  suited  for  both  tracking  and  regulation  of  nonlinear  systems  [7] .  In  addition,  it  is 
easily  extensible  to  parameter  adaptation  as  explained  below.  The  structure  in  Figure  2.1  is 
actually  quite  generic,  e.g.,  the  major  elements  appear  in  most  flight  control  systems.  Such 
a  structure  has  been  successfully  used  in  nonlinear  models  involving  jet  engine/propulsion 
control  systems  [15]  .  Many  simulation  studies  as  well  as  preliminary  theoretical  results  show 
that  the  configuration  has  some  very  nice  properties.  Specifically  : 

•  The  controller  design  can  be  conveniently  separated  into  two  design  steps:  (1)  design 
a  feedforward  signal  generator  H,  and  (2)  design  a  feedback  regulator  C.  These  steps 
are  not  entirely  uncoupled  and  the  issues  involved  are  discussed  in  Section  2.4  . 

•  The  feedforward  signal,  u,  is  designed  using  an  idealization  or  model  of  the  plant. 
Hence,  the  feedforward  system  signal  y,  is  the  ideal  output.  The  innovative  idea  for 
the  feedforward  design,  is  to  use  feedback  linearization  techniques  [5,  10]  to  design  the 
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map  from  the  reference  command  r  to  y  ,  the  ideal  output.1  Hence,  the  feedforward 
system  globally  and  directly  accounts  for  the  plant  nonlinearities.  No  linearization  and 
subsequent  elaborate  schemes  for  “scheduling”  linear  models  is  required.  There  are  a 
number  of  important  unresolved  issues.  For  example,  input  to  output  feedback  lin¬ 
earization  based  design  is  limited  to  minimum  phase  plants.  In  addition,  a  feedforward 
design  based  on  an  ideal  model  may  result  in  too  demanding  a  control.  Hence,  by 
accounting  for  plant  perturbations,  the  feedforward  signal  can  be  made  more  cautious. 
Suggestions  for  resolving  these  issues  are  proposed  herein. 

•  The  feedback  regulator  C  is  designed  to  correct  for  (hopefully)  small  deviations  between 
the  actual  sensed  output  y  and  the  ideal  output  y.  These  deviations  are  caused  by 
disturbances  as  well  as  modeling  errors  between  the  actual  nonlinear  plant  V  and  the 
model  used  to  design  7 i  and  C.  Currently,  we  have  investigated  a  number  of  methods 
for  designing  C.  In  particular: 


-  Linearization  about  equilibria  results  in  a  collection  of  LTI  designs  for  C  for  which 
standard  gain-scheduling  approaches  can  be  used  to  link  the  designs.  In  numerous 
simulation  studies  we  have  noted  excellent  robust  behavior  even  for  a  single  LTI 
regulator  design,  e.g.,  see  [8]  which  is  included  in  Appendix  A  . 

-  We  have  examined  linearization  about  the  model-follower  non-equilibrium  state, 
resulting  in  linear-time-varying  (LTV)  control  design.  A  specific  idea  is  put  forth 
in  (see  Section  2.4.1)  . 

-  The  regulator  can  also  be  designed  as  a  nonlinear  controller  whose  local  prop¬ 
erties  may  be  significantly  more  robust  than  either  the  LTI  or  LTV  designs  (see 
Section  2.3.3)  . 

•  Adaptation  can  take  place  in  either  H  and/or  C.  Adaptation  in  H  only  affects  the 
tracking  performance,  whereas  adaptation  in  C  affects  principally  regulation,  but  also 
affects  tracking. 


In  summary,  the  key  feature  of  the  proposed  scheme  in  Figure  2.1  is  that  the  controller 
H,C  is  implementable,  i.e.,  it  depends  only  on  the  available  signals  r,y.  This  is  unlike 
designs  based  solely  on  feedback  linearization  methods  which  rely  on  state  availability,  and 
hence,  are  not  implementable.  The  innovation  here  uses  the  power  of  feedback  linearization 
to  generate  feedforward  signals  which  render  harmless  the  dominant  negative  effects  of  the 
plant  nonlinearity.  We  have  also  studied  parameter  adaptation  and  some  of  the  results  are 
contained  in  Appendix  A  .  In  this  case,  if  the  underlying  nonlinear  control  scheme  is  not 
robust,  then  what  happens  is  precisely  the  same  as  in  the  case  of  parameter  adaptive  linear 
systems.  The  instantaneous  plant  and  estimated  model  mismatch  can  induce  an  unacceptable 
transient,  even  if  there  is  asymptotic  convergence,  e.g.,  [1,2]  . 


lA  brief  review  of  feedback  linearisation  is  provided  in  Section  2.2 
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2.2  Review  of  Feedback  Linearization 


A  major  element  in  our  effort  is  the  use  of  feedback  linearization  methods  for  designing  H  , 
the  feedforward  system  in  Figure  2.1  .  To  appreciate  the  issues,  and  why  we  chose  this 
configuration,  we  will  first  provide  a  brief  review  of  feedback  linearization. 

The  underlying  theory  shows  that  a  large  class  of  nonlinear  control  systems  can  be  made 
to  have  linear  input-output  behaviour  by  virtue  of  nonlinear  state  feedback.  In  the  course 
of  tracking  controller  design  using  feedback  linearization,  the  terms  zero-dynamics  and  rel¬ 
ative  degree  (see  e.g.,  [5]  and  references  therein)  are  used  to  describe  the  limitations  of  the 
approach.  Loosely  speaking,  the  relative  degree  is  the  number  of  differentiations  one  has  to 
perform  at  the  output  to  reach  the  control  input.  The  zero-dynamics  is  the  dynamics  that 
is  rendered  unobservable  when  feedback  linearization  is  performed  to  obtain  an  LTI  input- 
output  description  whose  transfer  function  has  the  associated  relative  degree.  A  nonlinear 
input-output  map  that  exhibits  stable  zero  dynamics  is  also  referred  to  as  minimum-phase  . 
As  an  illustration,  consider  the  disturbance-free  plant  system: 

x  =  Ax  +  f(x)  +  Bu  x  €  nr 
y  =  C x  IRm  ,  m  <  n  . 

Suppose  that  CB  is  invertible,  and  for  simplicity  let 

CB  =  I  . 


Then, 


y  =  C(Ax  +  /(x))  4-  u  . 


Since  u  affects  all  elements  of  y  after  one  differentiation,  the  system  is  said  to  have  relative 
degree  one  .  Moreover,  applying  the  nonlinear  state  feedback 


u  =  v  —  C(Ax  -f  /(x)) 


renders  linear  the  input-output  system  from  v  to  y  .  That  is, 


y  =  v 


Choosing 


v  =  A (r  -  y) 


gives 

y  =  A(r-y)  . 

Hence,  if  A  is  strictly  Hurwitz,  then  r  to  y  is  a  stable  LTI  system. 

Note  that,  in  the  above  process,  n  —  m  states  are  rendered  unobservable.  Hence,  even  if 
y  is  bounded,  the  remaining  states  of  the  closed-loop  system  (the  zero  dynamics)  need  not 
be.  Thus,  feedback  linearization  is  a  plant  inversion  based  control  design  approach,  and  is 
stabilizing  provided  that  the  associated  zero  dynamics  are  stable. 
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The  feedback  linearization  procedure  can  be  applied  to  more  general  disturbance-free 
systems  of  the  form 

x  =  /(x)  +  G(x)u  x  €  IRn 

y  =  h(x)  y,u  €  IRm 

Under  certain  conditions  involving  /  and  G  ( controllability  and  involutivity  [5]) ,  there  exists 
an  invertible  coordinate  transformation 


(  =  $(x)  ,  x  =  $_1(0 

such  that 

(  =  A(  +  B(aU)+m») 
y  = 

and  0(£)  is  invertible.  Then, 

u  =  ^nv-a(0] 

renders  linear  v  to  £  .  That  is 

(  =  A(  +  Bv 
y  =  M*"l(0)  • 


Since  the  system  from  v  to  (  is  now  LTI,  standard  LTI  control  designs  for  v  are  easily 
accomplished.  In  addition  if  $  can  be  chosen  such  that  h($-1(0)  —  C(  ,  for  some  matrix 
C  ,  then  the  map  from  v  to  y  is  also  LTI.  Note  that  for  this  case,  the  LTI  system  (A,  B ,  C ) 
need  not  be  minimum-phase. 

Despite  the  ease  of  control  design  in  the  transformed  coordinates,  feedback  linearization 
based  approaches  have  some  serious  drawbacks: 


•  control  laws  are  state  dependent  and  usually  the  states  are  not  completely  available 
as  sensed  variables.  Hence,  the  motivation  to  use  feedback  linearization  only  in  H  as 
in  Figure  2.2  . 

•  the  effect  of  disturbances  on  the  transformations  and  designed  control  laws  cannot  be 
guaranteed  unless  certain  matching  conditions  hold  [5]  . 

•  the  nonlinear  transformations  may  not  be  valid  if  plant  parameters  change  or  are 
uncertain. 

•  in  the  control  of  flexible  systems,  plant  models  Me  often  non-minimum  phase. 

•  even  when  linear  parametric  models  Me  used  in  one  state-space  description,  switching 
to  another  state-space  description  by  nonlinear  transformations  results  in  multilineM 
pMametric  dependencies.  Hence,  in  order  to  apply  adaptation  laws  for  lineM  paramet¬ 
ric  models,  overparametrization  is  required,  i.e.,  the  number  of  adapted  pMameters 
increases. 


CleMly,  the  power  of  feedback  lineMization  is  in  directly  handling  the  system  nonlineM- 
ities.  However,  the  unavailability  of  a  full-state  measurement  hinders  the  applicability.  For 
this  reason,  we  chose  the  configuration  of  Figure  2.1  . 
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2.3  Feedforward  System  Design 


We  use  feedback  linearization  methods  in  the  design  of  the  feedforward  system  H  (see  Fig¬ 
ure  2.2)  ,  such  that: 

•  the  feedforward  system  7i  achieves  some  desired  specifications  when  disturbances  are 
zero  (nominal  design)  , 

•  the  feedforward  system  exhibits  satisfactory  performance  for  preset  bounded  but  un¬ 
known  disturbances  (perturbation  study  of  the  nominal  design)  . 

•  the  performance  of  the  nominal  design  is  robust  under  model  variations  in  a  specified 
model  set. 

In  the  above  scenario,  the  disturbances  and/or  model  variations  do  not  exist  in  7i  , 
because  it  is  a  feedforward  signal  generator.  However,  the  resulting  signal  pairs  ( u ,  y)  can 
be  designed  in  anticipation  of  such  scenarios,  thereby  reducing  the  task  of  the  regulator  to 
be  robust. 

2.3.1  Nominal  Design 

For  the  tracking  controller  design  in  H.  (Figure  2.1)  ,  consider  the  interconnection  in  Fig¬ 
ure  2.2  .  Note  that  the  internal  signals  associated  with  the  copy  of  the  model  is  available 
by  construction  (e.g.,  states  x)  .  For  the  nominal  design  part,  assume  that  the  disturbances 
d.£  ,  d„  and  dy  are  all  zero. 


Figure  2.2:  Feedforward  system 

Apart  from  the  issues  mentioned  in  Section  2.2  regarding  zero  dynamics  and  coordinate 
transformations,  there  are  also  some  inherent  limitations  due  to  the  nonlinear  nature  of  the 
control  design  problem. 

Unlike  linear  systems,  nonlinear  systems  (or  operators)  are  not  right  distributive,  i.e., 
F{x  +  y)  ^  Tx  +  Ty  .  The  impact  of  this  restriction  can  be  seen  by  examining  the  effect  of 
disturbances.  Consider  a  bounded  input  disturbance  dy  and  a  stable  nonlinear  T  ,  where 

y  =  F{u  +  dy)  . 

The  output  y  can  also  be  expressed  as 

y  =  Tu  +  dy  , 
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where 


dy  —  +  d^)  -  Tu 

If  T  is  linear,  dv  is  also  a  bounded  disturbance,  with  a  bound  depending  only  on  the  bound 
of  du  and  independent  of  the  input  signal  u  .  This  property  of  linear  stable  maps  can 
be  generalized  to  what  is  called  as  incrementally  stable  nonlinear  maps,  where  bounded 
deviations  about  input  signals  result  in  bounded  deviations  at  the  output  signals  ;  moreover, 
an  output  deviation  bound  can  be  chosen  in  terms  of  the  input  deviation  alone.  In  other 
words,  the  uniform  Lipschitz  property  in  the  algebraic  nonlinearities  is  extended  to  include 
nonlinear  dynamic  systems.  Such  an  assumption  on  the  subsystems  in  a  feedback  system 
may  indeed  be  quite  conservative.  We  now  illustrate  the  conservatism  of  imposing  Lipschitz 
conditions  in  equivalent  state-space  descriptions. 

Typically,  the  plant  model  is  expressed  in  terms  of  states  that  have  physical  meaning 
(e.g.,  position,  rate,  etc.)  .  In  the  process  of  control  design  using  feedback  linearization, 
nonlinear  coordinate  transformations  are  used  to  obtain  a  state-space  description  where  the 
control  is  easier  to  express.  Such  transformations  can  drastically  change  the  nature  of  the 
nonlinearities  in  different  state-space  descriptions.  As  an  illustrative  example,  consider  the 
map  V  from  u  to  y  described  by 

ii  =  +  f(x  i) 

X2  = 
i3  =  It 

y  =  *1  , 

where  the  algebraic  nonlinearity  /  is  Lipschitz,  i.e.,  there  exists  a  k  >  0  such  that 
\f(x  +  y)  -  f(x) |  <  k\y |  ,  for  all  x  and  y  in  1R  .  Provided  that  /  is  at  least  twice  differentiable, 
consider  the  coordinate  transformation  from  x  to  where 

X\ 

(  -  x2  +  f(xl) 

.  X3  +  x2/^1)(sl)  -I-  /(xi)/{1)(xi) 

In  terms  of  the  state-£  ,  the  same  input-output  map  can  be  expressed  as 

<1  =  6 

^2  =  i 3 

6  =  «  +  6/(1)(6)  +  tf/(2)(6) 
y  =  6  • 

Clearly,  unless  vanishes,  the  nonlinearities  in  the  (  coordinates  will  not  be  Lipschitz  . 
Hence,  when  determining  a  stabilizing  law  of  the  form 

the  choice  of  v  can  make  a  considerable  difference  when  a  bounded  perturbation  of  is  used 
instead  of  (see  Section  2.3.3)  . 

With  these  generic  limitations  at  hand,  the  following  subsections  propose  methods  to 
cope  with  unstable  zero-dynamics  and  design  stabilizing  control  laws  subject  to  bounded 
disturbances.  Again,  all  the  proposed  controllers  are  state  dependent  which  is  permissible 
only  in  the  design  of  “H  ,  the  feedforward  signal  generator. 
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2.3.2  Coping  with  Unstable  Zero-dynamics 

As  mentioned  before,  a  design  technique  based  on  plant  inversion  will  be  stabilizing  provided 
that  the  zero  dynamics  are  stable.  When  the  zero-dynamics  is  not  stable,  regardless  of  the 
relative  degree,  a  control  law  based  on  plant  inversion  cannot  be  designed.  There  are  two 
possible  ways  to  avoid  explicit  plant  inversion: 

•  Construct  an  output  different  from  the  original  output  with  a  relative  degree  of  one 
and  with  stable  zero-dynamics.  Hence,  asymptotic  tracking  of  the  original  output  can 
be  achieved  through  the  control  of  this  constructed  output.  Promising  results  based  on 
this  two  step  procedure  have  been  reported  in  aerospace  applications  where  unstable 
zero-dynamics  are  associated  with  the  acceleration  outputs.  Feedback  linearization 
was  successfully  applied  to  the  control  of  wind  angle  components  (angle-of-attack  and 
sideslip  angle).  Normal  and  lateral  accelerations  were  then  controlled  via  integral 
feedback  built  upon  the  wind-angle  controller  [16,  17]  . 

•  For  certain  classes  of  input-output  maps  with  unstable  zero  dynamics,  a  nonlinear 
tracking  controller  can  be  designed  without  going  through  transformations  involving 
higher  order  derivatives.  In  the  following,  we  describe  two  such  classes.  These  methods 
can  be  considered  as  approximate  plant  inversion. 

Consider  the  class  of  plants  as  shown  in  Figure  2.3  .  In  this  case,  the  nonlinear  system 
from  u  to  y  can  be  decomposed  as 

v  =  P2VX  , 

where  V\  is  a  nonlinear  system  but  has  minimum-phase  zero  dynamics.  The  map  P2  from  z 
to  y  is  an  LTI  map  with  unstable  transmission  zeros.  Hence,  the  zeros  of  P2  are  the  source 
of  the  unstable  zero  dynamics. 


Figure  2.3: 

By  applying  standard  input-output  linearization  to  V\  in  terms  of  z  and  its  state  xx  ,  a 
control  v  can  be  constructed  such  that  the  map  from  v  to  z  is  LTI  and  stable  (recall  that  in 
the  feedforward  system  design,  all  signals  are  available  by  construction)  .  Denote  this  stable 
transfer  function  by  Htv  .  Now  the  tracking  problem  reduces  to  that  of  designing  a  tracking 
controller  for  the  linear  system  from  v  to  y  (not  necessarily  stable),  where 

y  =  P2Hxvv  . 

Clearly,  this  LTI  problem  is  easy  to  solve. 
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As  a  second  example,  consider  the  nonlinear  system  V  shown  in  Figure  2.4  ,  where  the 
subsystem  P\  is  LTI  but  possibly  unstable  and  nonminimum-phase.  The  subsystem  Vi  is 
nonlinear  and  stable  (not  necessarily  algebraic)  . 


Figure  2.4: 


For  the  interconnection  in  Figure  2.4  ,  since  Vi  is  stable,  the  zero  dynamics  is  completely 
determined  by  that  of  the  LTI  subsystem  zeros.  Due  to  perfect  information  in  the  feedforward 
system,  the  control 

u  =  ViV  +  C{r  -y)  , 

renders  the  map  from  r  to  y  LTI,  for  any  LTI  C  that  stabilizes  \  in  the  unity-feedback 
configuration. 


2.3.3  Incorporating  Disturbance  Models  in  the  Nominal  Design 

Once  a  nominal  design  is  obtained  for  the  disturbance-free  case,  the  interconnection  in 
Figure  2.2  is  guaranteed  to  be  only  locally  stable.  FVom  an  analysis  point  of  view,  such  local 
results  may  be  satisfactory.  However,  when  the  goal  is  control  design,  subject  to  a  preset 
bounded  but  unknown  disturbance,  the  design  procedure  must  be  suitably  modified.  In  fact, 
it  may  be  necessary  to  resort  to  nonlinear  control  design  in  the  transformed  coordinates  as 
well. 

The  following  points  emphasize  some  of  the  reasons  why  the  feedforward  system  design 
is  posed  as  in  Figure  2.2  : 

•  The  feedforward  system  is  in  fact  a  benchmark  design,  since  its  performance  is  that  of 
the  ideal  case  where  the  inner-loop  controller  is  operating  on  zero  errors. 

•  During  the  implementation  of  the  controller,  errors  will  be  introduced  in  subsystem 
computations;  hence  necessarily,  the  feedforward  system  performance  should  be  guar¬ 
anteed  for  disturbances  as  shown  in  Figure  2.2  . 

•  By  guaranteeing  a  stable  system  in  Figure  2.2  that  exhibits  desired  properties  for  the 
class  of  disturbances  under  consideration,  one  can  construct  a  family  of  input-output 
pairs  ( u,y )  ,  consistent  with  the  model,  rather  than  a  single  one.  Such  a  capability 
establishes  the  first  steps  towards  robustness  under  model  uncertainty. 
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In  the  following,  we  emphasize  the  need  for  nonlinear  control  in  the  transformed  coor¬ 
dinates  when  the  design  has  to  accommodate  possibly  persistently  exciting  bounded  dis¬ 
turbances.  First,  the  class  of  plants  is  described  with  motivating  cases  that  lead  to  the 
associated  structure.  Second,  the  approach  is  described  and  illustrated  by  an  example.  Fi¬ 
nally,  the  extension  of  the  particular  approach  to  incorporate  time-variations  in  the  plant  as 
well  as  the  feedback  law  is  discussed. 

Consider  the  stabilization  of  a  perturbed  controllable  canonical  form  description: 
i  =  J +  i?[a(£  +  di)  -f  +  d2)(u  -f  <£3)]  +  d4 

y  =  t 

with  (A,B)  a  controllable  pair,  (  available,  but  the  disturbance  d  unknown,  ex^  ^  that 
it  is  bounded  with  a  known  bound,  independent  of  (  .  As  emphasized  before,  even  if  the 
nonlinearities  in  the  original  states  is  Lipschitz,  the  transformed  state-descriptions  will  almost 
always  have  non-Lipschitz  nonlinearities.  This  problem  will  arise  when 

1.  the  states  are  available,  but  there  are  measurement  and  actuation  disturbances  during 
implementation,  or 

2.  there  is  a  state-£  estimator  which  supplies  an  estimate  (  with  up  to  first  order  bounded 
state  estimation  error.  Although  the  separation  principle  does  not  hold  in  general  for 
nonlinear  systems,  such  an  estimator  can  be  used  in  a  robust  state-feedback  (hence,  a 
coupled  design)  to  yield  implementable  stabilizing  controllers. 

In  the  following,  ||  •  ||  will  denote  the  sup  norm  on  signals,  thus  allowing  persistently 
exciting  finite  average  power  signals.  In  the  above  setup,  £  will  be  available;  however, 
d  =  [df  d3  d%]T  is  not  known,  except  that  ||d||  <  7  ,  for  some  positive  known  7  . 
This  particular  plant  description  is  motivated  by  the  controllable  canonical  form  within  the 
context  of  feedback  linerizable  plants  [5]  ,  i.e.: 

x  =  Ax  +  B[a(x)  +  @(x)u]  , 

where  (A,  B)  is  a  controllable  pair;  the  algebraic  functions  a  and  denote  the  transformed 
nonlinear  system,  /3  is  nonsingular  for  x  €  IR"  .  When  state-s  is  available,  the  control  law  tt  = 
Kx— ct(x))  steers  any  initial  condition  in  IRn  to  zero  asymptotically,  provided  (A—BK) 
is  strictly  Hurwitz.  However,  such  a  control  law  will  not  be  robustly  stabilizing  when  the 
closed-loop  system  is  subject  to  unknown  but  bounded  additive  disturbances  (unless  the 
extremely  restrictive  case  where  a  and  /3  are  Lipschitz)  .  In  order  to  account  for  the  bounded 
disturbances,  the  above  feedback  linearization  based  control  law  needs  to  be  significantly 
modified. 

Approach 

To  illustrate  the  approach,  consider  a  single  input  system  in  the  model  set  described 
above  (the  following  ideas  can  be  extended  to  a  multi-input  version  with  a  diagonal  structure 
assumption  on  the  function  /3)  . 
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Due  to  controllability  on  IRn  ,  0(()  ^  0  for  any  (  .  Apply  the  control 

u  =  -r'iiMt)  +  KQ  +  v 

where  K  is  chosen  such  that  ( A  —  BK)  is  strictly  Hurwitz  ;  w  is  to  be  determined.  In  fact,  in 
the  above  control  law  decomposition,  a  can  be  replaced  with  any  function  a  provided  that 

H(&  -  «)(OII  <  7i 

for  all  (  ;  i.e.,  a  is  in  a  71  tube  about  the  function  a  .  Rearranging  terms,  one  obtains 

(  =  (A-BK)(  +  dA  +  B0(i  +  d2){  AU,d)  +  v) 

where  d  is  unknown  but  A  is  a  known  function  determined  in  terms  of  a  ,  0  ,  d  and  K  .  Let 
<f>&  be  such  that 

-Mm*)  <  A(^,d)  <  mikii,7)  , 

for  all  £  and  unknown  d  .  Note  that  such  a  bounding  function  can  be  easily  constructed 
from  the  known  A  and  the  bound  7  .  Recall  also  that  the  sign  of  0  is  (  independent.  Now 
choose  the  term  v  in  the  control  as 

v  =  -sgn/3(0)sgn(^TPB)^A(||(||,7)  , 

where  the  Lyapunov  matrix  P  satisfies 

(A  -  BK)tP  +  P(A  -  BK)  +  Q  =  0 

for  some  symmetric  positive  definite  Q  . 

Consider  the  Lyapunov  function  candidate  V  =  (TP(  ;  for  the  above  control  law,  one 
obtains 

v  <  -eQt+*epd<  . 

The  simplified  upper  bound  on  V  is  due  to  the  fact  that  the  choice  of  v  renders 

{rPB0({ +  <*,)( A({,<0  +  1.)  <  0  , 

for  all  (  and  unknown  d  .  Hence,  the  nonlinear  damping  terms  in  v  compensate  for  the 
nonlinearity  mismatches.  It  is  interesting  to  note  that  at  any  time  instant,  the  cost  V(()  is 
not  greater  than  V(tj)  ,  where 


rj  —  (A  —  BK)rj  +  d<  , 


provided  that  £(0)  =  77(0). 

Example 

Consider  the  single-input  single-output  plant 

y  =  u  +  (y  +  di)2  +  d2 
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where  y  is  supposed  to  track  a  step  reference  input  r  subject  to  unknown  disturbances  d\ 
and  dj  .  Apply  the  control 

u  =  -y3  -(y  -r)  +  v  . 

Clearly,  without  the  v  term  to  be  determined,  u  is  not  stabilizing  unless  d\  =  0  ;  any  nonzero 
mismatch  will  cause  a  finite-escape  time.  Now  choose 

v  =  — £sgn(y  -  r )(a!  +  a2\y | )  , 


where 


|d^  +  da|  <  Q!i 
2|di|  <  Qa 

The  results  guarantee  asymptotic  tracking  for  6  >  1  . 

Remarks 

The  above  approach  is  a  semi-global  stabilization  based  design;  first  determine  a  worst- 
case  bound  on  the  disturbances,  and  then  a  design  is  guaranteed  for  the  specified  bounds. 

The  approach  is  not  limited  to  the  particular  model  set;  in  fact,  it  can  be  modified 
to  include  terms  of  the  form  a(t,£,Oyd)  where  the  time-variation  and  parametric  depen¬ 
dence  to  0  are  no  longer  lumped  all  in  one  disturbance  term  d  .  Hence,  the  apptoach  can 
be  made  less  conservative  by  e.g.,  choosing  a  time- varying  bounding  function  of  the  form 
<t>&{t,  |£(t)|,  |0(t)|,7)  •  Clearly,  tighter  upper  bounds  will  yield  less  demand  on  the  actuator. 


2.4  Feedback  System  Design 

We  investigated  the  following  methods: 

•  designing  C  based  on  linearization  about  equilibria,  thereby  resulting  in  an  LTI  con¬ 
troller  . 

•  designing  C  based  on  linearization  about  the  ideal  trajectories  tx  ,  y  ,  x  from  H  .  This 
will  result  in  an  LTV  controller  (see  Section  2.4.1)  . 

•  designing  C  based  on  a  Lyapunov  approach.  This  will  result  in  a  nonlinear  controller 
(see  Section  2.3.3)  . 

Note  that  all  of  these  methods  can  be  used  in  the  design  of  H  as  well.  But  the  full-state 
methods  used  to  design  H  cannot  be  used  to  design  C  .  It  is  important  to  emphasize  that  all 
the  signals  in  H  are  constructed  ,  whereas  C  must  be  only  constructed  on  sensed  variables. 
Thus,  the  state  of  V  is  not  available  to  design  C  .  But  the  states  of  a  model  of  V  are  available 
to  design  H  (and  also  C)  . 

One  of  the  advantages  of  the  proposed  feedback  linearization  based  design  of  the  feedfor¬ 
ward  system  is  that  the  scheduling  signal  z  is  completely  known.  It  is  constructed  ,  on  line, 
in  'H  ,  the  feedforward  system  (see  Figure  2.2)  . 
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As  an  illustration,  consider  a  nonlinear  system  of  the  form 

x  =  /(x)  4-  Bu 
y  =  Cx  , 

where  /(•)  is  a  nonlinear  algebraic  function  and  B  and  C  are  constant  matrices.  Following 
the  approaches  in  Section  2.3  ,  suppose  that  the  associated  tracking  problem  is  solved.  In 
other  words,  the  feedforward  system  in  Figure  2.1  is  designed  so  that 

x  =  /(x)  -|-  Bu 
y  =  Cx 

where  y  exhibits  the  desired  tracking  properties.  Note  that,  all  of  the  variables  associated 
with  the  feedforward  system  (i.e.,  the  '*  versions)  are  available  by  construction. 

Now  consider  a  first  order  approximation  to  the  error  system  described  in  terms  of 
e*  =  x  —  *,ey  =  y—  y  and  eu  =  u  —  tx  .  The  resulting  linear  time-varying  system 
can  be  expressed  as 

ex  =  A(x)e*  +  Beu 
tv  =  Ce* 

where  A{x)  =  §f(x)  is  the  Jacobian  of  /  evaluated  at  x  .  Because  x  is  constructed  in  the 
feedforward  block  H  ,  A{x)  is  a  known  time-varying  matrix.  By  construction  x  is  bounded. 
Typically,  x  will  have  the  decomposition 

x  -  Xtr  +  x„ 

where  x(r  and  x„  denote  the  transient  and  steady-state  components.  If  the  steady-state 
component  is  constant,  one  can  design  an  LTI  controller  for  the  equilibrium  LTI  description. 
If  the  steady-state  component  is  periodic  and  slowly- varying,  averaging  can  be  applied  to  de¬ 
sign  time-invariant  controllers.  In  the  following  section,  we  propose  a  time- varying  controller 
design  approach. 

2.4.1  Time- varying  Controller  Design 

Consider  the  unity-feedback  configuration  in  Figure  2.5  .  In  the  rest  of  this  section  we 
will  focus  on  the  case  where  P  is  linear-time  varying  (LTV)  .  An  LTV  controller  C  will  be 
designed  so  that  the  closed-loop  is  stable.  Moreover,  the  controller  can  be  modified  (adapted) 
online  to  improve  the  nominal  performance. 

Suppose  that  there  exists  an  LTI  controller  Co  that  stabilizes  the  unknown  LTV  plant 
P  .  Using  stable  factor  factorizations  [18]  ,  let  Co 

Co  =  ND~l  =  D~lN 

with  the  associated  Bezout  identity 
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where  ail  of  the  eight  maps  are  LTI  and  stable.  Under  these  conditions,  all  LTV  plants  that 
are  stabilized  by  Co  admit  a  special  factorization: 


p  =  d;'np 
D,  =  V-Qpn 
Np  =  U  +  QPD  , 

where  Qp  is  an  unknown  but  stable  LTV  system.  Now,  choose  an  LTV  controller  C  =  NCD ~l, 
where 


u-u  =  Nc(  =  ( N  +  VQe)( 

y-y  =  DU  =  (D-UQ'K 

Qc  is  an  LTV  stable  map  to  be  chosen,  and  (  is  an  available  signal  by  construction  of  C  . 


u 


Figure  2.5: 

Now,  for  the  specified  C  and  P  ,  the  system  in  Figure  2.5  is  stable  if  and  only  if  the 
closed- loop  map  from  (tz,  y)  to  £  is  stable.  Equivalently, 

DPDC  +  NPNC  =  I  +  QPQC 

has  a  stable  inverse.  Now  suppose  that  the  unknown  Qp  has  a  known  gain  bound  (e.g.,  an 
L2- gain)  denoted  by  ||QP||  <7  .  Choose  k  LTI  stable  Qd’a  such  that, 

n«»ii  <  1 
<  1  h 

»=1 

q .  =  • 

»=i 

By  construction,  the  gain  of  QpQe  is  less  than  unity.  Hence,  by  the  small  gain  theorem, 
(I+QpQc)  has  a  stable  inverse.  In  other  words,  for  any  choice  of  A  that  satisfies  the  inequality 
constraint  above,  the  resulting  controller  will  stabilize  the  unknown  P  . 

The  unity-feedback  system  interconnection  also  constrains  the  set  of  admissible  X  values. 
Writing  the  summing  node  equations  in  Figure  2.5  ,  we  obtain 

(Vy-Uu-0  =  QP{Ny  +  Du  +  Qc()  . 
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Note  that,  the  above  consistency  equation  is  of  the  form  rji  =  Qpt)2  ,  where  rj  is  known,  Qp  is 
unknown  but  a  bound  on  its  gain  is  available.  In  the  case  of  predetermined  L2-gain  bound 
on  Qp  ,  the  consistency  equation  translates  into  inequality  constraints  on  truncated  norms 
of  tj  [13]  .  Since  the  controller  parameter  Qc  is  linear  in  A  ,  this  in  turn  translates  into  a 
quadratic  constraint  on  A  .  This  quadratic  constraint,  together  with  the  constraint  A,  < 
I/7  ensure  that  the  choice  of  A  realizes  a  stabilizing  controller  and  is  consistent  with  the 
interconnection  equations.  Now  an  additional  performance  criterion,  e.g.,  the  tracking  error 
Dc£  can  be  monitored  for  improvement  (the  tracking  error  Dc{  is  affine  in  the  parameter 
A). 

2.5  Adaptation 

Consider  the  generic  nonlinear  plant  model  shown  in  Figure  2.6  ,  where  G  denotes  the  known 
linear  part  and  T  denotes  the  algebraic  nonlinearity  (e.g.,  friction,  saturation)  which  depends 
on  an  unknown  parameter  6  .  The  signals  z  and  v  at  the  input  and  output  of  the  nonlinearity 
are  unavailable  as  measurements.  However,  for  the  design  of  H  in  Figure  2.1  ,  all  signals 
will  be  available  by  construction,  except  that  6  will  be  replaced  by  the  estimate  6  . 

y 


Figure  2.6:  A  parametric  nonlinear  plant  model  V  :  (u,  9)  y  . 

Once  the  design  of  Ti  is  based  on  6  ,  the  interconnection  in  Figure  2.1  can  be  modified 
to  that  shown  in  Figure  2.7  .  The  motivation  behind  the  closed-loop  system  in  Figure  2.7  is 
that  for  sufficiently  slow  adaptation,  the  performance  of  the  inherent  model  follower  tracking 
design  will  be  recovered  (e.g.,  [1])  . 

2.5.1  Application  to  Adaptive  Testbed 

The  Adaptive  Testbed  apparatus  at  ARDEC  poses  a  particularly  interesting  nonlinear  con¬ 
trol  design  problem  suitable  for  the  cascade  implementation  in  Figure  2.1,  where: 

1.  the  states  are  not  available  as  measurements, 

2.  the  model  exhibits  non-minimum  phase  zero  dynamics 

3.  the  goal  is  tracking  performance  improvement 

4.  an  analytical  model  is  available  as  in  Figure  2.6  , 
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Figure  2.7:  Adaptation  of  the  cascade  system  in  Figure  2.1 

5.  candidate  designs  can  be  easily  tested  out  in  real-time. 

Except  for  the  discrete-controller  implementation  stage,  all  of  these  points  coincide  with  the 
issues  addressed  in  this  research.  For  the  preliminary  studies  on  the  analytical  model  of  the 
testbed,  see  Appendix  C  . 


2.6  Towards  a  Nonlinear  Control  Design  Based  on  Measured 
Variables 

Within  the  context  of  finite-dimensional  linear  time-invariant  (LTI)  feedback  interconnec¬ 
tions  and  LTI  plants,  full-order  estimated  state-feedback  based  design  approaches  are  widely 
used  in  control  design  problems.  After  suitable  augmentations,  the  design  problem  is  reduced 

r  A  B  1 

(can  be  taken  as  strictly  proper,  without  any  loss  of  gen- 


to  the  stabilization  of 


C  0 


—  LC)  are  both  strictly 
The  relative  ease  of 


erality)  subject  to  a  particular  cost  criterion  (e.g.,  Hi  and/or  Tf*,).  The  analytical  and/or 
numerical  solutions  to  the  associated  optimization  problem  yields  a  state-feedback  gain  K 
and  an  output-injection  gain  L  ,  where  the  matrices  ( A  —  BK )  and  {A 

Hurwitz.  The  resulting  controller  is  given  by  ^  L 

—K  0 

solutions  associated  with  such  design  problems  stems  from  the  inherent  decomposition  into 
two  sub-problems  which  can  be  dealt  with  separately.  The  estimation  of  the  state  and  the 
control  law  based  on  the  ideal  state  are  solved  separately,  and  then  the  estimated  state  is 
used  instead  of  the  true  plant  states  in  the  control  law.  This  two-step  approach  is  referred 
to  as  the  separation  principle. 

Before  proceeding  with  nonlinear  control  design  methods  and  why  the  separation  principle 
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fails  in  general,  it  is  useful  to  observe  the  inherent  properties  that  make  the  separation 
principle  work  in  the  LTI  setting.  Consider  the  plant  P  :  u  h-»  y  , 

x  =  Ax  +  Bu 
y  =  Cx 

with  a  stabilizable  pair  ( A,B )  and  a  detectable  pair  (C,  A)  .  If  the  plant  state  x  were 
available,  a  control  of  the  form  u  =  —Kx  would  solve  the  stabilization  problem.  Since 
x  is  not  available,  an  estimate  x  is  constructed  using  (u,y)  or  perturbed  versions  thereof. 
Consider  the  estimator  E  :  (ue,ye)  •— »  x  , 

x  =  (A  —  LC)x  +  Bu,  +  Lye  , 

where 

ue  +  du  =  u 
y  +  dy  =  j/e 

where  the  exogenous  inputs  d^  and  dy  denote  the  unmeasured  but  bounded  actuator  find 
sensor  disturbances,  respectively.  Letting  ex  :=  x  — x  ,  and  using  the  estimates  in  the  control, 
i.e.,  ue  =  —Kx  ,  the  closed-loop  system  is  described  by 

x  =  (A  —  BK)x  +  BKex  +  Bdu 
e*  =  (A  —  LC)ex  -|-  Bdu  —  Ldy  . 

The  properties  that  make  the  separation  principle  work,  can  be  listed  as  follows: 

1.  For  bounded  (du,dy)  ,  ex  is  bounded.  When  (d^jdy)  =  0  ,  ex  — »  0  . 

2.  For  bounded  estimation  error  e*  ,  x  is  bounded.  As  ex  — ♦  0  ,  x  — +  0  . 

Due  to  possibly  persistently  exciting  disturbance  (du,dy)  ;  ex  is  not  expected  to  go  to 
zero.  Hence,  the  state-feedback  (in  terms  of  x)  must  be  robust  to  perturbations  ex  which 
does  not  necessarily  correspond  to  initial  conditions  mismatches  in  P  and  E  . 

For  the  sake  of  argument,  suppose  that  one  indeed  has  an  asymptotic  observer  and  a 
“globally”  asymptotically  stabilizing  full-state  feedback  law.  Intuitively,  for  “small”  errors 
in  the  state-estimate,  the  separation  principle  based  control  design  will  be  stabilizing.  Thus, 
after  waiting  “sufficiently  long”  for  reliably  reconstructing  the  states,  one  can  use  the  esti¬ 
mated  states  instead  of  the  true  states  in  the  control.  Apart  from  the  nontrivial  assumption 
that  such  an  asymptotic  observer  is  available,  another  pitfall  in  such  an  argument  is  that 
one  may  not  be  able  to  wait  without  a  nominal  bounded-input  bounded-output  (BIBO)- 
stabilizing  loop,  since  the  plant  may  blow  up  before  estimation  errors  are  sufficiently  small, 
(e.g.,  x  =  xa  +  u,  u  =  0,  x(0)  =  x0  >  0,  x(t)  =  •  The  existence  of  such  a  nominal 

stabilizing  loop  is  the  same  as  requiring  a  control  that  robustly  BIBO-stabilizes  the  plant, 
although  the  desired  performance  may  not  be  achieved. 

Imposing  the  algebraic  nonlinearities  in  the  ordinary  differential  equations  to  be  globally 
Lipschitz  in  order  to  extend  the  appealing  properties  of  linear  vector  fields,  may  be  quite 
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restrictive.  Nonlinearities  in  one  coordinate  system  may  be  Lipschitz,  whereas  in  an  another 
coordinate  system  they  may  not  be  (recall  the  discussion  on  page  10)  . 

For  a  given  nonlinear  plant  V  :  u  *-*  y  ,  consider  the  tracking  problem:  for  a  particular 
class  of  reference  signals  r  ,  determine  a  control 

u  =  £(r,y) 

such  that  y  — ►  r  and  for  a  reasonable  class  of  bounded  disturbances,  under  the  perturbed 
control 

u  =  du  +  K(r,y  +  dy) 

the  closed-loop  signals  remain  bounded.  In  other  words,  the  closed-loop  (see  Figure  2.8)  is 
BIBO-stable  (in  the  sense  of  a  particular  extended  space)  and  achieves  the  desired  tracking 
when  disturbances  are  zero. 

dy 

Figure  2.8:  Desired  dynamic  controller  K 

We  will  focus  on  a  particular  class  of  single-input  nonlinear  plants  which  admit  a  finite¬ 
dimensional  state-space  description  of  the  form 

x  =  /( i)  +  g{x)u 

for  which  the  State-Space  Exact  Linearization  Problem  [5]  is  solvable  over  Ht",  i.e.,  control¬ 
lability  and  involutivity  conditions  hold  in  IR"  .  For  such  plants,  when  states  are  available, 
the  origin  can  be  rendered  globally  asymptotically  stable.  Moreover,  when  x  is  available  for 
feedback,  the  asymptotic  tracking  problem  can  be  solved  for  any  output  that  does  not  intro¬ 
duce  a  nonminimum-phase  zero  dynamics.  In  the  case  where  the  output  y  to  track  r  does 
introduce  a  nonminimum-phase  zero  dynamics,  one  may  be  required  to  introduce  dynamic 
augmentation  prior  to  full-state  feedback.  Such  strong  global  results  do  rely  on  perfect  state 
measurement  and  render  controller  candidates  as  shown  in  Figure  2.9  . 


Figure  2.9:  Solvable  design  problem,  dynamic  controller  Co;  dx  =  0,  dy  =  0 

Taking  into  account  the  control  law  in  Figure  2.8  and  the  fact  that  one  typically  encoun¬ 
ters  number  of  sensors  much  less  than  the  number  of  states,  one  might  dismiss  the  plant 
state  based  control  laws  on  the  grounds  that  they  are  not  implementable.  As  it  is  motivated 
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by  the  separation  principle  based  control  design,  for  classes  of  plants  that  admit  asymptotic 
observers  or  in  the  case  of  bounded  state-estimation  errors,  the  stability  properties  of  the 
control  law  with  y  =  x  in  Figure  2.8  is  of  major  importance  (see  also  Figure  2.11). 

Motivated  by  the  separation  principle  based  control  design,  a  design  approach  can  be 
taken  as  follows: 

•  Assumption  :  Imposing  stability  properties  on  (u,y)  is  equivalent  to  imposing  them 
on  x  .  The  plant  V  :  u  t— >  y  does  not  have  any  “hidden  modes” .  The  input  output  pair 
(u,y)  completely  characterizes  the  internal  dynamics. 

•  Problem  I  :  An  estimator  that  allows  the  decoupling  of  the  design  problem  (see 
Figure  2.10).  Recall  from  the  motivating  LTI  setting,  that  the  role  of  the  estimator 
is  to  guarantee  that  for  all  bounded  du,  bounded  dy  and  possibly  unbounded  ue  ,  the 
error  ( x  —  x)  is  bounded.  Moreover,  when  dt^  =  0  and  dy  =  0  ,  ex  — ►  0  .  In  other  words, 
for  the  interconnection  in  Figure  2.10  ,  the  error  ex  must  be  uncontrollable  from  u  and 
the  bound  on  ex  must  depend  only  on  the  bound  on  the  exogenous  variable  dy  .  This 
property  is  the  crux  of  the  separation  principle,  since  it  simplifies  the  design  problem 
to  Problem  II  . 


u 


A 


EST  *0*-dy 

IT 

Figure  2.10:  Problem  I;  estimator  with  (x  —  2)  G  L «,  for  all  u  in  and  dv  in  Le 


•  Problem  II  :  A  stabilizing  state-feedback  law.  For  bounded  dx,  d^  and  dy,  determine 
a  controller  C\  for  which  the  closed-loop  signals  remain  bounded.  Moreover,  when 
d«  =  0  and  dy  =  0,asdx— *  0  ,  y  — »  r  .  Note  that  this  step  is  nontrivial  since 
the  controller  C0  in  Figure  2.9  is  typically  based  on  d*  =  0  and  dy  =  0  .  The  local 
stability  properties  of  the  design  in  Figure  2.9  may  not  suffice  for  the  specified  bound 
on  dx  obtained  from  Problem  I.  With  a  slight  abuse  of  notation,  from  now  on  we  will 
refer  to  C\  in  Figure  2.11  as  a  robust  controller,  since  its  design  does  take  into  account 
the  predetermined  bounds  on  dx  and  dy  .  The  abuse  is  due  to  the  fact  that  in  the 
LTI  setting,  robustness  is  attributed  to  closed-loop  signal  dependent  perturbations, 
since  exogenous  additive  disturbances  cannot  drive  a  stable  LTI  loop  unstable.  Since 
local  and  global  stability  results  merge  in  the  LTI  setting  and  global  stability  results 
are  too  restrictive  in  the  nonlinear  setting,  perhaps  a  better  suited  description  for  the 
controller  C\  in  Figure  2.11  is  semi-global  ;  i.e.,  for  the  given  bounds  on  du,dx  and  dy 
the  closed-loop  signals  remain  bounded. 

•  Merging  Solutions  to  Problems  I  and  II  :  Recall  that  the  solution  to  Problem  I 
guarantees  that  bounded  deviations  about  possibly  unbounded  input  output  pairs  of 
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V  result  in  bounded  ex  .  Hence  one  can  incorporate  Figure  2.10  in  Figure  2.11  as  the 
bounded  disturbance  dx  generator  to  obtain  Figure  2.12  . 


Figure  2.12:  Candidate  controller  K 


Note  that,  in  the  case  that  the  estimator  in  Problem  I  satisfies  the  property  that  e*  — »  0 
for  du  —  0  and  dy  =  0  ,  that  is  when  x  is  asymptotically  reconstructed,  the  choice  of  the 
robust  controller  C\  in  Problem  II  guarantees  that  the  closed-loop  signals  remain  bounded 
due  to  bounded  dx  ;  in  addition,  the  nominal  design  constraint  on  C\  ensures  that  y  — *  r  as 
dx  -»  0  . 

As  a  preliminary  step  towards  control  design  based  on  state-estimates,  the  following  two 
studies  show  the  importance  of  the  choice  of  the  nominal  globally  stabilizing  (or  tracking) 
control  law  based  on  perfect  state  measurements. 

In  the  first  study,  set-point  asymptotic  tracking  problem  is  solved  for  an  output  that 
exhibits  nonminimum-phase  zero  dynamics.  Using  dynamic  augmentation  and  then  applying 
state-feedback  asymptotic  tracking  problem  is  solved.  In  other  words,  the  controller  Co  in 
Figure  2.9  is  obtained.  The  derivations  emphasize  that  the  standard  Lipschitz  constraints, 
for  ease  of  Lyapunov  based  derivations,  on  the  resulting  vector  fields  after  possible  changes 
in  coordinates  is  too  restrictive. 

In  the  second  study,  stability  properties  of  the  origin  is  investigated  when  the  feedback  law 
is  based  on  estimates  from  an  asymptotic  observer.  It  is  shown  that  the  globally  stabilizing 
exact  feedback  linearization  based  controller  Co  in  Figure  2.9  is  no  longer  globally  stabilizing 
in  Figure  2.10  .  A  globally  stabilizing  controller  C\  that  solves  the  Problem  II  is  constructed. 
Using  the  asymptotic  observer,  the  controller  K  in  Figure  2.12  is  obtained. 
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2.6.1  Example  1:  Tracking  and  Nonminimum-Phase  Zero  Dynamics 


In  order  to  illustrate  a  strictly  causal  plant  with  a  one- dimensional  possibly  nonminimum- 
phase  zero  dynamics,  consider  the  following  two-state  u  to  y  map. 

®i  =  x2  +  /(*  1) 

®2  =  4(aja  —  ®i)  +  u 

y  =  xj  ®i 

Note  that  when  /  =  0  ,  the  transfer  function  from  u  to  y  is  ■ 

It  is  easy  to  see  that  a  tracking  control  based  on  the  coordinate  transformation  using 
derivatives  of  y  will  not  be  stabilizing.  Note  that 


y  =  4y  +  it  -  x3  -  /(®i) 


For  f  =  0  , 
results  in 

hence  for  any  a  >  0  ,  y 


u  =  -4y  +  x2  +  /(®i)  +  a(r  -  y) 

(y-r)  =  -ot(y-r)  ; 
r  .  However,  the  asymptotic  tracking  constraint 

®2  =  *i  +  y(0)e-Q‘ 


renders 

ii  =  Xi  +  /(®i)  +  r  +  (y(0)  -  r)e~at  . 

Take  for  example,  r  =  0,  /(•)  =  (-)3  .  Clearly,  in  the  limit 

®i  =  ®i  +  x\  , 

and  the  states  are  unbounded. 

When  /  is  zero,  it  is  well  known  that  step  inputs  can  be  asymptotically  tracked  if  and 
only  if  plant  and/or  the  compensator  have  at  least  one  pole  at  zero.  Since  does  not 
have  a  pole  at  zero,  the  standard  LTI  design  procedure  would  involve  augmentation  with 
an  integrator.  Proceed  with  the  same  augmentation  for  the  nonlinear  plant  at  hand  (see 
Figure  2.13)  . 


r 


Figure  2.13:  Augmented  plant 


If  i  were  available,  since  t]  is  part  of  the  controller,  the  augmented  plant  states  are  avail¬ 
able.  If  one  can  find  a  relative  degree  three  output,  i.e.,  solve  the  state- feedback  linearization 
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problem,  then  rj  — ►  0  would  imply  that  y  —*  r  .  This  requires  that  the  augmented  plant 
dynamics 

y  =  r  -  x2  +  ®i 

±1  =  *a  +  /(*i) 
x2  =  4(x2  —  Xi)  +  tt 

satisfies  the  controllability  and  involutivity  conditions.  Both  conditions  are  satisfied  provided 
that 

/(1)(* i)  ±  -1 

which  we  will  assume  for  global  results.  It  is  interesting  to  note  that  this  condition  is  in  fact 
a  generalization  to  the  hidden  mode  concept  in  the  LTI  setting,  i.e.,  if  /(x i)  =  — xj  ,  the 
linear  plant  from  tt  to  y  would  have  a  hidden  unstable  mode. 

Now  consider  the  relative  degree  three  output 


z  =  T)  +  x  i 

motivated  by  the  /  =  0  case.  Note  that 

=  r  +  xi  +  /(xj) 

2(2)  =  (1  +  f^\xi))(Xj  +  f(Xj)) 

PM 

and  recall  that  the  (global)  controllability  condition  is  equivalent  to  /3(xi)  /  0  for  all 
Xi  €  IR  ■  Hence,  under  the  coordinate  transformation 


t  = 


zW 

*<a) 


the  augmented  plant  dynamics  can  be  put  into  the  controllable  canonical  form 

(}  =  6 

6  =  6 

6  =  /(2)-(a:2  +  /)2+/?(si)(u  +  4(x2-x1)  +  /(1)-(x2+/)) 

a(x)  +  /3(xi)u 

where  dependencies  on  xi  Me  suppressed  in  the  /  terms,  for  the  sake  of  brevity.  For  any  rpi, 
■02  and  fa  such  that  the  polynomial 

s3  -f-  fas2  +  fas  +  fa 


is  strictly  Hurwitz,  the  control  u  determined  by 

a(x)  +  0{xi)u  =  fa(i  -  fab  -  fa(3 
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renders  (  =  0  globally  asymptotically  stable  (recall  that  the  transformation  to  {  coordinates 
is  one-to-one  and  onto)  .  Hence 


0  =  r  +  xi  +  /(xi) 

0  =  0(x1)(xa  +  /(x1)) 


and  0(x\)  ^  0  imply  that 


r  +  xi  —  xa  — »  0 


i.e.,  y  — ►  r  .  Note  that  the  above  exact  cancellation  based  scheme  is  not  the  only  way  to 
stabilize  the  augmented  plant  in  the  (  coordinates.  In  fact,  as  we  will  see  later  on,  such 
an  exact  cancellation  may  make  the  global  nature  of  the  nominal  result  not  robust,  i.e., 
perturbations  in  the  states  (other  than  initial  conditions)  may  render  the  result  only  locally 
stable.  Hence,  it  is  crucial  that  one  studies  the  sensitivity  of  the  full-state  based  control 
design  and  modify  it  if  possible  so  that  for  the  class  of  anticipated  disturbances,  the  local 
nature  of  the  control  law  is  satisfactory. 


2.6.2  Making  the  Separation  Principle  Work 

Consider  the  plant 

x  =  Ax  +  Bu  -I-  F(y) 
y  =  Cx 

with  (A,  B,  C)  a  minimal  triple.  Suppose  that  there  exists  a  nonlinear  coordinate  transfor¬ 
mation 

i  =  ♦(*) 

such  that  the  plant  can  be  equivalently  represented  as 

(  =  X(  +  B(u  +  q({)) 

V  =  c*-‘(0 

For  such  a  system,  asymptotic  state  construction  is  easier  in  the  x-coordinates,  since 

x  =  Ax  +  Bu  +  F(y)  +  L{ y  -  y) 
y  =  Cx 

yields  an  estimation  error  e,  :=  x  —  x  ,  where 

ex  =  (A  -  LC)ex  . 

The  stabilizing  control  is  easier  to  express  in  the  ^-coordinates  since 

u  =  -Kt-  a( 0 

renders  £  =  0  globally  asymptotically  stable  since  the  above  control  yields 

(  =  ( A-BK)i  . 


26 


The  heuristic  step  of  replacing  (  by  $(s)  can  only  guarantee  local  results.  For  “sufficiently" 
small  ex  ,  the  estimated-state  feedback  will  be  locally  stabilizing.  The  crucial  points  are  how 
large  the  basin-of-attraction  is  and  if  it  is  possible  to  enlarge  it  to  yield  a  satisfactory  design. 

Let  (  $(x)  .  The  closed-loop  dynamics  under  the  feedback  law 

u  = 

can  be  expressed  as 


f 


e 


X 


(A  -  BK)(  + 
(A  -  LC)ex 


dx 


*=*-'(?) 


M(i) 


(  LCu  +  F(C*-‘(()  +  e.))  -  F(C»-'(Q) ) 
0((,  e.) 


Even  if  F  is  Lipschitz  (hence  es)||  depends  only  on  ||ex||),  M(f)  need  not  be  Lips- 
chitz.  Hence  establishing  and  modifying  the  baain-of-attraction  are  nontrivial  design  tasks. 
However,  the  crucial  observation  is  that  the  stabilization  problem  is  in  fact  the  robustness 
of  the  full-state  feedback  law  subject  to  Li  disturbances. 

The  following  example  illustrates  the  need  for  a  robust  state-feedback  law  and  the  sensi¬ 
tivity  of  the  exact-linearization  based  control  law. 


2.6.3  Example  2 

Consider  the  following  one  state  plant  model  V  :  u  •— ►  y 

y  =  y3  +  « 

where  the  goal  is  to  render  y  =  0  globally  asymptotically  stable.  The  perfect  state  (namely 
the  plant  output)  is  not  available  due  to  an  unknown  output  disturbance  d,  where  d  G  Li 
and  d  G  Loo  •  The  fact  that  d  — ►  0  motivates  the  control  law  candidate 

u  =  -y3  -  ay 

with  a  >  0  .  We  show  that  such  an  exact  cancellation  based  control  law  yields  only  locally 
stable  results,  whereas  choosing  a  nonlinear  control  law  for  the  linearized  plant  significantly 
improves  the  results. 

The  motivation  behind  the  choice  of  d  above  is  possible  asymptotic  estimation  error 
subject  to  initial  condition  mismatches.  The  over-simplicity  of  this  one-state  plant  should 
not  be  misleading,  since  even  if  y(0)  is  known  perfectly,  the  estimator  built  for  plants  of  the 
form  x  =  Ax  +  Bu+F(y)  with  y  available,  will  not  necessarily  guarantee  zero  state-estimation 
error.  In  fact,  if  one  were  to  build  an  estimator 

y  =  u  +  y3  +  (y  -  y) 
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Figure  2.14:  Control  under  perturbed  state  measurement 

and  base  the  control  law  on  y ,  by  setting  d  =  y  —  y  ,  one  obtains 

d  =  -d  (2.1) 

which  implies  the  L2  disturbance  in  the  plant  states  (see  Figure  2.14). 

Consider  the  controller  of  the  form 

C(a,0)  :  y  t-»  u  ,  u  =  —ay  -  0y*  . 

for  a  >  0  and  0  >  1  and  the  disturbance  d  as  in  (2.1).  The  dynamics  of  the  resulting 
closed- loop  system  in  Figure  2.14  is  described  by 

y  =  — a(y  +  d)  +  y3  —  0(y  +  d)3  (2.2) 

d  =  -d  (2.1) 


Without  getting  into  extensive  simulations  to  obtain  the  phase-portraits  for  different  a 
and  0  values,  it  is  useful  to  derive  some  qualitative  properties  of  (2.2)  and  (2.1)  analytically. 

For  a  >  0  and  0  >  1  ,  (y,d)  =  (0,0)  is  the  only  equilibrium  point.  The  Jacobian  at  (0,0) 
has  eigenvalues  —a  and  —1  ,  with  associated  eigenvectors  (y,0)  and  (y,  ~^y)  ,  respectively. 
Since  d(t)  cannot  change  sign,  the  phase  portrait  can  be  decomposed  into  three  invariant 
sets:  d  >  0  ,  d  =  0  and  d  <  0  ;  i.e.,  the  trajectories  remain  exclusively  in  the  set  that  the 
initial  condition  belongs  to.  There  is  an  odd  symmetry  in  the  flows  since  (y,  d)  and  (— y,  —d) 
satisfy  the  same  differential  equations.  Hence,  it  suffices  to  consider  the  flow  for  d  >  0  since 
d  =  0  case  is  asymptotically  stable  for  all  y(0)  . 

The  purpose  of  the  exercise  is  the  choice  of  the  nonlinear  control  determined  by  0  ; 
although  the  basin-of-attraction  will  be  affected  by  the  choice  of  a  ,  the  qualitative  properties 
for  0  =  1  and  0  >  1  are  generic.  In  the  following,  we  will  consider  the  case  where  a  =  1, 
(i.e.,  the  eigenvectors  are  colinear).  Let  the  Lyapunov  function  candidate  be  V  =  |(y2  +  d3) . 
Evaluating  V  along  the  solutions  of  (2.2)  and  (2.1)  ,  we  obtain 

V  =  (1  -  0)y*  -  ya  -  (1  +  3/SyV  -  yd(l  -I-  30y2  +  0d2)  .  (2.3) 

From  (2.3),  one  can  easily  deduce  the  following: 

•  0  >  1  V  is  eventually  negative,  due  to  the  dominating  first  term  in  (2.3)  .  Hence,  y 
remains  bounded  for  any  (y(0),d(0))  .  Once  d  gets  sufficiently  small  ,  y  — ►  0  . 
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•  0  —  1  The  first  term  on  the  right  hand  side  in  (2.3)  is  zero.  The  last  term  changes 
sign  according  to  (y,  d)  in  the  first  or  second  quadrant.  Clearly,  for  any  (y(0),d(0))  in 
the  first  quadrant,  the  volume  V  decreases,  however,  the  trajectories  might  continue 
into  the  second  quadrant,  where  the  last  term  becomes  positive.  In  fact,  for  sufficiently 
large  d(0)  ,  the  system  exhibits  finite  escape-time. 
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Abstract 

The  robustness  to  parameter  mismatch  in  feedback 
linearization  based  nonlinear  tracking  systems  is  inves¬ 
tigated.  The  certainty-equivalence  principle  gives  rise 
to  four  possible  feedback  configurations:  One  is  the 
widely  used  case,  where  the  states  are  assumed  to  be 
available;  two  others  are  observer-based;  the  last  one 
is  a  model-follower  based  on  a  feedforward /feedback 
implementation  using  a  signal  generator.  It  is  noted 
that  the  unacceptable  transient  behaviour  of  the  adap¬ 
tive  tracking  scheme  is  closely  related  to  the  sensitivity 
of  the  underlying  certainty-equivalence  based  control, 
which  is  analyzed  through  a  perturbation  approach. 
Simulations  are  performed  on  an  example  to  illustrate 
the  points. 

i  1  Introduction 

Consider  the  following  nonlinear  tracking  problem: 


•  For  a  given  nonlinear  plant  with  state-space 
scription  .  =  f{xu0) 

0  =  0 


y  =  h(x,u,0)  , 

and  a  given  constant  reference  signal  r  ,  deter¬ 
mine  a  feedback  law  u  =  C(y,r)  such  that 
the  closed-loop  system  is  internally  stable  and  the 
output  y  asymptotically  tracks  r  . 

The  parameter  0  is  clearly  uncontrollable.  The  de¬ 
sired  control  law  has  to  be  expressed  in  terms  of  the 
measured  plant  variables  ;  such  control  laws  will  be 
classified  as  implementable  .  The  goal  is  in  fact  a  ro¬ 
bust  tracking  controller,  since  performance  is  achieved 
despite  the  uncertainty  in  the  uncontrollable  variable 
0  . 


When  0  is  exactly  known,  the  tracking  problem  has 
been  solved  for  special  classes  of  plants  that  can  be  ren¬ 
dered  linear  under  algebraic  state-feedback  and  change 
of  coordinates  (e.g.,  [Isi  1]  and  Section  II  of  [Sasljl  . 

When  0  is  unknown,  the  widely  used  approach  to 
design  an  adaptive  tracking  scheme  is  the  certainty- 
equivalence  principle:  the  tracking  problem  is  solved 
as  if  0  is  known  and  then  the  above  control  is  based  on 
the  estimate  0  (see  [Sasl]  and  the  references  therein)  . 

There  are  two  problems  with  the  above  approaches. 
First,  the  states  must  all  be  sensed  variables,  which  is 
often  not  the  case;  hence  the  controllers  are  not  imple¬ 
mentable  in  our  sense.  Secondly,  certainty-eauivalence 
based  designs  are  not  guaranteed  to  be  rooust,  and 
hence,  the  adaptive  tracking  performance  may  be  un¬ 
acceptable  due  to  poor  transient  behaviour,  a  phe¬ 
nomenon  well  known  in  the  adaptive  linear  control 
(e  g.,  [Andl]) 

‘Research  supported  by  AFOSR,  Directorate  of 
Aerospace  Sciences  under  contract  F49620-90-C-0064 
and  ARO,  Engineering  Sciences  Division  under  contract 
DAAL03-91-C-0011. 


A  solution  to  the  implementability  problem  is  to 
use  an  estimate  of  the  state  in  the  controller.  Un¬ 
fortunately,  there  is  no  dual  theory  of  observer  design 
for  feedback  linearizable  systems.  Adaptive  controllers 
based  on  observers  have  been  obtained  in  the  litera¬ 
ture  for  specific  classes  of  nonlinear  plants  (e.g.,  under 
output-matching  conditions  in  [Kanl])  . 

Addressing  the  second  problem  requires  designing  a 
robust  nonlinear  tracking  controller  before  any  adap¬ 
tation  is  introduced.  Hence,  prior  information  on  the 
plant  description  and  the  parameter  uncertainty  are 
crucial  to  improve  the  transient  behaviour. 

In  this  paper  we  consider  a  special  class  of  nonlin¬ 
ear  plants.  A  direct  interpretation  of  the  certainty- 
equivalence  principle  gives  rise  to  four  possible  feed¬ 
back  configurations. 

The  first  one  is  the  widely  used  case,  where  the 
states  are  assumed  to  be  available.  Despite  the  un¬ 
derlying  implementability  problem,  we  address  the  is¬ 
sue  of  sensitivity  through  a  perturbation  analysis  of 
the  approach.  Simulations  are  performed  on  an  ex¬ 
ample  to  illustrate  the  points.  Based  on  this  track¬ 
ing  scheme,  we  derive  an  adaptive  tracking  controller. 
Performance  of  the  tracking  scheme  is  illustrated  in 
an  example.  Although  the  responses  are  bounded,  the 
transients  are  very  large.  This  unacceptable  transient 
behaviour  is  closely  related  to  the  sensitivity  of  the  un¬ 
derlying  (non-adaptive  frozen  parameter  0)  certainty- 
equivalence  based  controller. 

The  other  three  approaches  are  all  implementable. 
The  state  availability  assumption  is  dropped  at  the 
expense  of  global  results.  Two  of  the  schemes  use  the 
state-estimate  in  two  different  coordinates.  The  third 
is  a  new  model-follower  scheme.  An  example  system 
is  used  throughout  to  illustrate  the  system  behaviour 
using  the  four  schemes. 

2  Plant  Description 

We  consider  a  class  of  single-input  single-output 
nonlinear  plants  which  can  be  exactly  input-output 
linearized  under  algebraic  state-feedback  and  change 
of  coordinates.  The  class  under  consideration  has  no 
zero  dynamics.  Specifically,  we  assume  the  following: 

1.  The  plant  dynamics  has  the  state-space  descrip¬ 
tion 

x  =  Ax  +  Bn  +  /0(x)  +  F\(x)0 
y  ~  cTx  '  ’ 

where  x  G  IRn  is  the  state,  u  G  1R  is  the  scalar 
control  input,  y  G  1R  is  the  scalar  sensed  output. 

2.  The  matrices  A  and  B  are  in  the  Brunowsky 
canonical  form.  cT  :=  [  1  0  0  . . .  0  ]  . 

3.  The  smooth  nonlinear  functions 

fo  :  nr  -  nr  and  F\  :  HI”  — *  IR"*'’ 
are  known  and  have  the  lower-triangular  forms 
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/<Xi)(x)  =  /o(o(*l  .  *3  ,  •  •  -  .  *i)  and  = 

Fi(ij)(*i ,  x2 ,  . .  .  ,  x.  )  for  «  =  ,  j  = 

1 . P 

4.  0  6  IRP  is  a  veclor  of  unknown  constant  parame¬ 
ters. 


2.1  Remarks 

For  the  above  plant  description,  there  exists  a  possi¬ 
bly  parameter  dependent,  coordinate  transformation, 
£  =  <&(x,0)  ,  where  $(  •  ,  0)  is  a  diffeomorphism  for  all 
0  .  Under  the  coordinate  transformation  4  ,  (1)  can 
be  equivalently  described  by 

£  =  A{  +  B[u  +  0o(x)  +  Gi(x)V>] 

y  =  cT(  w 


where  the  over-parametrization  ip  consists  of  entries  of 
appropriate  tensor  products  of  0  . 

The  class  of  plants  under  consideration  are  trans¬ 
formable  to  j =  u  +  yo(x)  +  Gi(x)ip  ;  hence,  they 
are  state-dependent  perturbations  of  a  string  of  n- 
integrators. 

For  notational  convenience,  we  will  refer  to  the  lin¬ 
ear  part  in  (2),  i.e.,  the  triple  ( 1,A,B )  as  the  linear 
plant  P  .  Let 

^(x)  :=  ffo(x)  +  G\(x)ip  (3) 

This  class  of  plants  satisfies  the  parametric  strict- 
feedback  condition  in  [Kan2]  ;  hence,  provided  that 
the  state  x  is  available  for  feedback,  one  can  obtain  a 
globally  adaptive  tracking  system  for  the  plant  in  (1) 
using  the  procedure  in  [Kan2]  . 

The  class  of  plants  under  consideration  include  the 
so-called  benchmark  example  ,  where  n  =  3,  p  =  1, 
/o(x)  as  0  and  Fj(x)  =  [/(xi)  0  0]T  ,  where  /(•)  is  not 
necessarily  Lipschitz  in  xx  . 

We  will  denote  “state-x  is  available  for  feedback” 
rather  than  “state  is  available  for  feedback”  ,  in  order 
to  emphasize  the  specific  coordinate  the  dynamics  is 
described  in.  Note  that  even  if  state-x  in  (1)  is  avail¬ 
able,  state-£  is  not  available,  since  it  depends  on  a 
0-dependent  transformation  where  0  is  unknown. 

Whenever  we  refer  to  a  stable  map,  the  map  is 
causal  and  bounded-input  bounded-output  stable  (de¬ 
fined  over  an  appropriate  extended  space).  With  a 
slight  abuse  of  notation,  a  stable  proper  transfer  func¬ 
tion  h(s)  will  be  denoted  as  A  in  the  input-output 
description  where  hG(x)  will  denote  the  time-domain 
waveform  obtained  bv  convolving  the  impulse  response 
corresponding  to  h(s )  and  the  signal  G(x)  .  Paranthe- 
ses  will  be  used  where  ambiguity  arises.  Unless  specif¬ 
ically  emphasized,  factorizations  denote  proper  stable 
factorizations. 

2.2  Example 

A  simple  third-order  example,  satisfying  the  condi¬ 
tions  in  Section  2,  is  used  to  illustrate  certain  points 
throughout  the  paper.  Consider 


X 1  =  X2  +  0/(Xj) 

*2  =  X3 

X3  =  U 

y  =  xi  . 


(4) 


Since  we  are  interested  in  an  input-output  approach, 
all  initial  conditions  in  the  simulations  are  assigned  as 
zero. 

Candidates  for  the  function  /(•)  that  we  have  used 
are: 

•  /(* i  )  —  —  tanh(oxj)  ,  motivated  by  a  simple  sat¬ 
urating  friction  nonlinearity.  Note  that  although 
/  is  Lipschitz,  the  corresponding  Gi  is  not. 


•  f(z\)  =  — x*  ,  following  the  benchmark  problem 
in  [Kan2]. 

•  /(*i)  =  -x\l(ax\  +  1) 

The  function  /  used  in  a  particular  simulation  will  be 
emphasized.  We  introduce  the  following,  for  future 
reference. 

p=l[i  ,  .>]r 


♦(*)= 


*1 

*j  +  0/(xi) 

x34tfx,/<1)(x1)  +  02/(xi)/(^(x1) 


f  -* 


h 

fe-VKi) 

b-ohPHti) 


i 


0o(x)  :=  0 


G,(x):= 


x\fW{z\)  +  Xa/t^xi) 

2  x,/(x,J/<*)(x»)  +  x2{^)(xl)y 

/(*i)(/'I,(*i))a  +  /2(*  i)/<2)(x,) 


iP  :=  [  0  02  0s  ]T 
Note  that  for  this  example, 


Gi(x)v>  =  (/<2>(f>)f22  +  /<1)(fl)fc)0  (5) 

=:  G  i(()®  i 

which  emphasizes  the  issue  of  overparametrization  in 
different  coordinates. 

Another  observation  which  becomes  useful  in  the 
perturbation  description  is  the  map  ($<f~J  —  7) ,  where 

$*-‘(0  =  $  6  -®/(6) 


=  /(()  +  (0-0) 


0 

mi) 

6/^tfi)  +  */(6)/(1)(6) 


(6) 

3  State-x  Available 
3.1  On  Sensitivity  of  State-x  Based  Sta¬ 
bilization 

The  contents  of  this  particular  subsection  focuses  on 
a  slight  generalization  of  the  class  in  Section  2  ,  where 
the  input  u  need  not  be  scalar. 

Consider  the  input  u  to  state-x  map  described  by  (1) 
or  equivalently,  by  (2) .  Under  a  change  of  coordinates 
($)  ,  the  input  u  to  state-£  map  can  be  rendered  linear 
under  an  algebraic  state-feedback  T  (see  (3))  .  Hence 
u  to  state-x  map  can  be  realized  as  in  the  dashed-box 
in  Figure  1  . 

The  map  from  v  to  (  is  linear  (time-invariant  fi¬ 
nite  dimensional).  Let  £  =  Pv  .  Since  P  admits 
coprime  factorizations,  P  =  ND~ 1  =  D~lN  with 

[  #  ~ij  ]  =  I  ,  where  all  eight  maps 


in  the  above  identity  are  stable. 

The  nonlinear  map  is  stable  and  its  inverse  is 
stable. 

The  algebraic  map  T  is  stable.  In  fact,  for  the  sub¬ 
sequent  derivations  in  this  subsection,  $  and  T  need 
not  be  algebraic. 

Note  that,  the  input  to  state-x  map  in  Figure  1  can 
be  expressed  as 
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Figure  1:  Stabilizing  scheme  based  on  the  estimates 
T  and  $ 


Vx  =  (7) 

=  (DQ-Nf)-lN  (8) 

where 

(U$  +  Vf)(*~lN)  +  {V)(D-f*-lN)  =  I  (9) 
(D<b-  Nr^Q-W)  +  {N)(U  +  P<Sf~1V)  =  /  (10) 

Note  'hat,  (9)-(10)  imply  that  (7)-(8)  are  in  fact 
nonlinear  right-  and  left-coprime  factorizations  of  Vx  , 
respectively.  The  equations  (9)  and  110)  also  describe 
a  stabilization  scheme  based  on  undoing  the  nonlin- 
eanties,  since  a  stabilizing  C  :  x  «-*■  u  is  given 
by 

C  =  -K*  -  ?  , 

where  ti  =  — /££  stabilizes  P  . 

In  order  to  incorporate  the  0  dependence  in  the  u  to 
x  map,  consider  <b  and  T  both  functions  of  0  whereas 
P  is  independent  of  0.  Note  that  such  an  assumption  is 
not  restrictive,  since  typically  the  choice  of  coordinates 
and  algebraic  state-feedback  constructed  to  render  P 
as  a  string  of  integrators  (in  each  channel).  From  now 
on,  P  and  its  associated  terms  in  the  Bezout-identity 
will  be  considered  as  ^-independent.  The  0  dependence 
in  $  and  T  will  be  emphasized  by  introducing  the  ~- 
versions  when  they’re  determined  by  the  parameter 
estimate  6  . 

We  now  can  express  the  standard  stabilization 
scheme  based  on  certainty-equivalence  approach  in 
terms  of  the  maps  introduced  so  far.  Since  0  is  not 
exactly  known,  the  control  law  from  x  to  u  is  realized 
as 

«  =  *(£-$(«))-£(*)  ,  (ii) 

where denotes  the  desired  reference  in  terms  of  the 

state-£  (see  Figure  1)  .  In  other  words,  when  0  =  0, 
the  tracking  performance  is  determined  by 

£  =  Pv 
®  =  A'(£r  -  £)  . 

Note  that  the  tracking  scheme  in  (11)  can  be  equiv¬ 
alently  represented  by  the  control  law, 

*  =  A'((r-£)  , 

where  A^is  a  £r-tracking  compensator  for  P  :  v  *-< >  £  , 
and  v  ,  £  are  as  shown  in  Figure  2  . 

Note  that  the  stable  feedback  perturbation  in  Fig¬ 
ure  2  can  be  expressed  as  {T  —  ,F)$_l  ,  since  x  = 
*-1(£)  is  available,  although  and  £  are  not.  As  long 


Figure  2:  Perturbation  of  P  :  v  *-»  £  under  0  mis¬ 
match 


as  the  feedback  law  from  £  to  u  is  chosen  as  a  lin¬ 
ear  time-invariant  K  stabilizing  P  =  ND~l  ,  one  ob¬ 
tains  the  precise  condition  under  which  the  certainty- 
equivalence  based  control  iaw  can  withstand  0  pertur¬ 
bations.  Before  stating  the  conditions,  we  will  adopt 
the  following  definition. 

•  Let  K  stabilize  P  (i.e.,  K  —  V~lU  and  P  = 

N D-i  ,  where  UN  +  VD  =  I)  .  K  is  said  to 
stabilize  the  perturbed  plant  in  Figure  2  iff  the 
control  law  u  =  ti2  +  K(ux  —  £)  yields  a  stable 
(ui ,  U2)  »-»  z  ,  where  z  denotes  the  pseudo-state 
in  Figure  2  (see  also  Figure  1)  . 

With  this  definition,  one  can  work  out  the  pseudo¬ 
state  equation  to  derive  the  necessary  and  sufficient 
condition  for  K  to  robustly  stabilize  the  perturbed  v 
to  £  map,  i.e., 


has  a  stable  inverse.  Note  that  P  can  be  stabilized 
using  only  its  first  output,  i.e.,  there  exist  U  =: 

(  Ui  0  0  . . .  0  ]  ,  such  that  UN  +  VD  =  /  .  Hence 
using  a  dynamic  feedback  compensator  of  the  form 
K  =:  (  C  0  0  ...  0  ]  ,  C  stabilizes  P  ;  moreover,  the 

map  in  (12)  further  simplifies  to  (  /+  V(T— Jr)$~1N  ) 
since  cT($$-1  —  I)  =  0  . 

Typical  robustness  results  (small-gain,  passivity 
based  sufficient  conditions)  can  be  utilized  together 
with  assumptions  on  the  nonlinear  post-multiplicative 
and  feedback  perturbations  in  (12)  (such  as  sector 
bounded  nonlinearities,  linear  or  multilinear  0  depen¬ 
dence  etc.)  to  generate  classes  of  systems  for  which 
(12)  can  be  justified.  However,  we  are  not  interested  in 
further  restricting  the  set  of  plants  under  investigation 
for  the  sake  of  forcing  some  sufficient  conditions.  For 
the  simple  example  in  Section  2.2  ,  even  if  the  nonlin¬ 
earity  /(•)  is  chosen  to  be  globally  Lipschitz,  the  same 
is  no  longer  true  for  T  . 

3.2  Sensitivity  to  e$  :  Simulation 
The  example  in  Section  2.2  with  /(*i)  =  —  x,  is 
used  for  the  following  simulation. 

Recall  that,  since  £t  =  x,  =  y  ,  from  (6)  , 

cT(<j>$~1  _  J)  —  0  .  Hence,  by  choosing  a  dynamic 
compensator  C  from  y  to  v  ,  the  error  introduced  by 
the  output  multiplicative  term  in  Figure  2  is  avoided. 

The  compensator  C  :  y  »-►  v  is  chosen  as  an  W2- 
optimal  compensator  for  1/s3  for  a  particular  choice 
of  weights.  Throughout  the  simulations,  C(s)  = 

«»+9.5J19js+35.3333j+66  6297  ■  ^ 

The  sensitivity  of  the  closed-loop  to  e#  .=  0  —  0  is 
illustrated  as  follows:  The  parameter  0  is  chosen  as 
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0=1.  For  a  unit-step  reference  input  the  parame¬ 
ter  estimate  0  is  perturbed  about  the  nominal  value 
1  .  When  6  =  0,  the  closed-loop  system  exhibits  the 

desired  tracking  response  of  a  linear  system.  When  0 
is  perturbed  to  1.0025  and  1.005  ,  the  tracking  perfor¬ 
mance  of  the  loop  is  shown  in  Figure  3  . 


dpz  =  u  + g0(z)  +  Gi{z)xp 
Nz  =  Z 
*x  =  Z 
v  =  £i  . 

where  N  and  dp  are  stable;  cTN  =:  np  and  ( np,dp ) 
is  a  right  coprime  factorization  of  l/s"  .  Since  np  is 
minimum-phase,  npd~x  is  strongly  stabilizable.  Hence 
there  exist  ne,de  with  d~}  stable,  satisfying  ncnp  + 
dcdp  =  1  . 

Since  N  is  strictly  proper,  Q  is  strictly  causal,  where 

G  =  Go  +  ($»(•))* 

fro*  =  So*  (15) 

Q\z  =  G\x  .  (16) 

Hence, 

dpz  =  u  +  Qz 


Figure  3:  Sensitivity  of  certainty-equivalence  based 
design  using  state-x  for  6  =  1  ,  0  =  1  ,  1.0025  ,  1.005  . 

Clearly,  within  0.5%  relative  error,  the  asymptotic 
tracking  property  is  lost  and  the  system  goes  unstable. 
An  adaptation  law  based  on  this  sensitive  closed-loop 
is  bound  to  exhibit  unsatisfactory  transient  behaviour 
even  if  asymptotic  tracking  is  achieved  by  parameter 
adaptation.  Note  that,  Figure  3  shows  the  closed-loop 

responses  for  frozen  6  values.  Hence,  one  should  defi¬ 
nitely  avoid  very  slow  adaptation. 

3.3  Adaptation 

For  the  plant  description  in  Section  2  ,  provided 
that  the  state-x  is  also  available  for  exact  linearization, 
one  can  bring  an  input-output  approach  to  a  particu¬ 
lar  case  of  certainty-equivalence  based  adaptive  control 
design. 

We  now  outline  the  design  procedure: 

1 .  For  the  class  of  plants  in  Section  2  ,  determine  go 
and  G\  ,  reducing  zero  columns  of  Gi  (if  any)  to 
cut  down  on  unnecessary  overparametrization  in 

v-. 


describes  a  well-posed  u  to  z  map.  Since  &  and  are 
stable  maps  and  ( np,dp )  is  a  right-coprime  pair,  the 
plant  can  be  equivalently  represented  by  the  following 
pseudo-state  equation 

dpz  =  u  +  G* 

npz  =  y  . 


Note  that  although  z  is  not  available,  f ?0z  and  G\z  are 
available  since  state-x  is  available  ((15),(16))  . 

Hence  the  plant  description  from  u  to  y  can  be  ex¬ 
pressed  as  ,  ,  „._i 

y  =  np {dp  —  Q)  . 

Now  apply  the  certainty  equivalence  based  control 


law 


ti  =  ti2  +  ^(ui  -  y)  -Go*  -  (Giz)rp 

“c 


where  ui  and  U2  denote  the  exogenous  additive  inputs 
at  the  plant  input  and  output  in  the  standard  unity- 
feedback  system,  respectively. 

For  U2  =  0  ,  Uj  =  r  ,  where  r  denotes  the  desired 
reference,  we  obtain 


2.  Design  a  stable  stabilizing  compensator  C(s)  for 
l/sn  ;  i.e.,  C  =:  nc/dc,  l/s"  =:  np/dp  and  npne  + 
dPde  =  1  . 

3.  An  adaptive  tracking  control  law  candidate  is  de¬ 
termined  by  applying  the  control 

u  =  C(r  -  y)  -  g0(x)  -  (Gix)tjj  ,  (13) 


y-npncr  =  dcnp((Giz)e^,)  . 

Since  denp  is  minimum-phase,  one  can  utilize  an  aug¬ 
mented  error  scheme  to  update  . 

In  order  to  cut  down  on  the  number  of  states  intro¬ 
duced  by  the  filters  in  the  augmented  error  scheme,  we 
use  the  filtered  error  ey  , 


where  the  parameter  estimate  ip  is  updated  using 
the  error  equation 

y-npncr  =  dcnp((G  ix)e^,)  ,  (14) 

so  that  y  — ♦  npncr  .  (e^  =  rp  -  tp  and  r  denotes 
a  reference  signal  that  tracks  a  step  input.) 

Note  that  (y  —  npner)  in  (14)  denotes  the  error  be¬ 
tween  the  plant  output  and  the  ideal  tracking  perfor¬ 
mance  (if  the  nonlinear  cancellations  were  exact)  . 

Consider  the  nonlinear  plant  described  in  Section  2 
shown  in  Figure  1.  For  the  scalar  input  case,  let  D  = 
dp  . 

The  pseudo-state  equations  can  be  written  as 


ey  :=  d^iy-n^r)  .  (17) 

Since  np  is  minimum-phase,  one  can  adapt  xp  using  the 
error  form 

ey  =  np({Qyz)e^)  .  (18) 

Since  (dcdp)( 0)  =  0  ,  when  cancellation  is  exact, 
step  inputs  can  be  asymptotically  tracked. 

We  now  derive  the  update  law,  based  on  the  error 
equation  in  (18)  .  Since  it  utilizes  the  standard  aug¬ 
mented  error  seneme,  we  briefly  outline  the  steps. 

Since  np  is  minimum-phase  and  strictly  proper,  it 
can  always  be  expressed  as  np  =:  /»i/»2  .  where  hi  is 
strictly  positive  real,  strictly  proper  and  is  proper 
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^stable  Note  that  hi  =  ,-^y  and  /i2  =  (s  +  l)np  will 
always  work,  since  np  is  strictly  proper.  However,  to 
cut  down  on  the  number  of  states  introduced  by  filtcr- 
■ng  by  h2  in  the  augmented  error  implementation,  it 
Bs  advisable  to  factor  out  a  strictly  positive  real  factor 
™bf  np,  instead. 

For  ease  of  notation,  let  W  :=  Qxz  .  Then  (18) 

tan  be  written  as  ey  =  /»2( )  .  After  adding, 

ubtracting  terms  and  using  the  fact  that  =  0  ,  we 
obtain 

V  =  , 

jpwhere  17  :=  ep  —  -  h2(Wtp))  .  Since  hi  is 

strictly  positive  real,  the  update  law 


1 

tit 

Ihe 

■th 


V>  =  -r{h2W)Tri  ,  7  >  0 


guarantees  that  i)  £  C2  fl  C ,»  and  e^,  €  C<x>  ■  Provided 
that  the  closed-loop  signals  are  bounded,  W  €  £»  I 

ience,  jj  — *  0  ,  rj>  — ►  0  and  ey  — »  0  .  This  concludes 
hat  set  point  tracking  is  achieved  in  the  limit  provided 
that  the  closed-loop  signals  remain  bounded. 

13.3.1  Adaptive  Tracking:  Simulation 

The  adaptation  scheme  above  is  applied  to  the 
example  in  Section  2.2  for  two  choices  of  /  .  In 
both  adaptive  schemes,  the  closed-loop  signals  remain 

■bounded;  hence  asymptotic  tracking  is  achieved.  How¬ 
ever,  the  transient  responses  are  unacceptable.  Note 
that  in  both  cases,  even  without  adaptation,  state-z 
based  certainty-equivalence  control  performance  was 
extremely  sensitive  to  constant  parameter  errors. 


I 

I 

I 

I 

I 

I 

I 


/(*  l)  =  +  1) 

reference  r  =  1  ,  0  =  1. 


a  =  0.05  ,  7  =  10  , 
This  function  approxi- 
The 


mates  f(x\)  =  —x\  reasonably  for  |xi|  <  2 
tracking  performance  is  shown  in  Figure  4. 

/(*  1)  =  -tanh(axi),  0  = -1,0  =  4, 7  =  1, 
reference  r  =  0.1  .  The  tracking  performance  is 
shown  in  Figure  5. 
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Figure  4:  Asymptotic  tracking  of  r  =  1  from  zero 
initial  conditions 

4  On  Implementable  Certainty- 
Equivalence  Based  Controllers 

Recall  that  the  example  in  Section  2.2  can  be  ex¬ 
pressed  in  x  or  (  coordinates  as  in  (1)  and  (2),  respec¬ 
tively.  Note  also  that,  due  to  (5),  there  is  no  need  for 
overparametrization  in  the  (  coordinates. 

If  the  parameter  vector  0  and  either  the  state  z  or 
€  are  known,  then  either  the  control 


u  =  v-  0Gi(O 


(19) 


o 
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Figure  5:  Asymptotic  tracking  of  r  =  0.1  from  zero 
initial  conditions 


or,  in  terms  of  the  state  x,  the  control 

u  =  v-Gi(x)ip  (20) 

provides  exact  input-output  linearization.  That  is,  the 
system  v  y  is  linear  time-invariant,  where 

t  =  At  +  Bv 
i/  =  6 

Let  C(s)  be  the  same  stabilizing  compensator  (for 
(cT  ,A,  B))  as  in  Section  3.2  .  Hence  v  =  C(r  -  y)  , 
where  r  denotes  the  desired  reference,  achieves  the  de¬ 
sired  tracking  performance  when  0  =  0. 

Since  the  parameter  vector  0  and  neither  state  x  nor 
{  are  known,  it  is  natural  to  replace  them  in  (19)  and 

(20)  with  estimates  0,  2,  and  i.  This  is  the  certainty 
equivalence  principle.  As  it  does  make  a  difference 
wnich  observer  state  is  used,  we  consider  them  sepa¬ 
rately.  As  in  Section  3.2  ,  the  following  simulations 
are  based  on  /(xj)  =  -xf  . 

4.1  Observer  in  the  State-x 

The  control  law  is 

x  =  (A  -  Lct)x  +  Bu  +  Fi(x)0  +  Ly 
u  =  v-Gi(x)j,  ,  1  > 

where  (A  -  Lcr)  is  strictly  Hurwitz.  Since  all  simula¬ 
tions  are  performed  with  zero  initial  conditions,  pro¬ 
vided  that  0  =  0,  the  nominal  performance  is  identical 
to  that  of  the  linear  equivalent. 

The  sensitivity  of  the  closed-loop  to  e#  :=  0  —  0  is 
illustrated  as  follows:  The  parameter  0  is  chosen  as 
0=1.  For  a  unit-step  reference  input  the  parame¬ 
ter  estimate  0  is  perturbed  about  the  nominal  value 
1  .  When  0  is  perturbed  to  1.1  and  1.2  ,  the  tracking 
performance  o!  the  loop  is  shown  in  Figure  6  . 

4.2  Observer  in  the  State-£ 

The  control  law  is 

t  =  (A  -  LcT)Z  +  B[u  +  6Gi(Z)}  +  Ly  (22) 

u  =  v-0G,(O  , 

where  ( A  —  LcT )  is  strictly  Hurwitz.  Since  all  simula¬ 
tions  are  performed  with  zero  initial  conditions,  pro¬ 
vided  that  0  =  0,  the  nominal  performance  is  identical 
to  that  of  the  linear  equivalent. 
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Figure  6:  Sensitivity  of  certainty-equivalence  based 
design  using  the  estimate  of  state-x  for  6  —  1  ,  6  ~ 
1  ,  1.1  ,  1.2. 


Figure  7:  Sensitivity  of  certainty-equivalence  based 
design  using  the  estimate  of  state-£  for  0  =  1  ,  0  = 
1  ,  1.25  ,  1.5  ,  1.75  ,  2  . 


For  a  unit-step  reference,  0  is  perturbed  1.25,  1.5, 
1.75  and  2  ;  the  tracking  performance  of  the  loop  is 
shown  in  Figure  7  . 

4.3  Model-Follower 

This  feedback-feedforward  control  scheme  is  based 
on  a  locally  stable  unity-feedback  system.  For  the 
given  nonlinear  plant  V  :  u  ►-+  y  ,  let  C-p{s)  locally 
stabilize  the  plant  in  the  unity-feedback  system  about 
the  reference  r  .  A  model  is  used  to  generate  the  feed¬ 
forward  signals.  The  model  is  a  copy  of  the  plant  de¬ 
scription,  where  6  is  replaced  by  0  .  Since  all  states 
are  available  for  this  model,  exact  linearization  can  be 
performed.  Let  C(s)  be  the  compensator  that  is  being 
used  so  far  in  the  previous  three  certainty-equivalence 
based  designs.  Subscript-m  denotes  the  model  vari¬ 
ables. 

The  control  law  is  (see  Figure  8)  : 


II 

> 

3 

+  Bvm 

J/m 

=  CT£m 

um 

=  C(r 

—  Jim) 

Um 

=  “ 

-  OGtiU) 

U 

=  tim 

+  Cv{ym  — 

For  the  specific  example,  C-p  =  C  achieved  local  stabi¬ 
lization  about  the  unit-step  reference.  6  is  perturbed 
to  1.25,  1.5,  1.75  ,  2  ,  2.5  ,  2  75  and  3  ;  the  tracking 
performance  of  the  loop  is  shown  in  Figure  9  .  Note 
that  this  scheme  exhibits  the  least  sensitive  tracking 
design. 


Figure  8:  Model-follower  described  by  (23),  where 
E{vm i £m)  -=  Vm  —  0Gi(£m)  • 


I 


Figure  9:  Sensitivity  of  the  model-follower  based 
certainty-equivalence  design  for  0  =  1,0  — 

1  ,  1.25  ,  1.5  ,  1.75  ,  2  ,  2.5  ,  2.75  ,  3  . 


5  Concluding  Remarks 

An  input-output  approach  has  yielded  some  insight 
into  robustness  of  feedback  linearizable  schemes  un¬ 
der  parameter  mismatch.  Our  simulations  of  adap¬ 
tive  nonlinear  systems  show  a  phenomenon  familiar 
in  adaptive  linear  systems.  Namely,  even  though  the 
system  is  theoretically  globally  stable,  the  transient 
can  be  quite  large.  The  problem  is  inherent  in  the 
parameter  sensitivity  of  the  system  when  the  adapta¬ 
tion  is  frozen.  The  observer-based  and  model-follower 
schemes  are  most  likely  more  robust  to  parameter 
changes  then  the  full-state  available  scheme,  which 
in  general,  is  not  implementable  unless  the  states  are 
measured. 
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Abstract 


An  input-output  approach  is  used  to  investigate  sta¬ 
bilisation  of  a  class  of  nonlinear  plants  that  can  be  ren¬ 
dered  linear  time-invariant  under  nonlinear  stable  dy¬ 
namic  feedback.  Results  are  obtained  for  the  standard 
unity-feedback  configuration  subject  to  bounded  dis¬ 
turbances.  It  is  shown  that  the  design  method  based 
on  applying  the  inverse  transformation  on  any  stabilis¬ 
ing  controller  for  the  transformed  linear  time-invariant 
plant,  does  not  necessarily  yield  a  stabilising  controller 
for  the  nonlinear  plant.  Conditions  under  which  the 
method  is  justified  are  derived. 

1  Introduction 


Due  to  the  relative  ease  of  solving  control  problems  as¬ 
sociated  with  finite-dimensional  linear  time-invariant 
(LTI)  plants,  the  class  of  nonlinear  plants  that  can 
be  rendered  LTI  by  algebraic  state-feedback  has  re¬ 
ceived  considerable  attention  in  the  literature  (e.g., 
pail,  SasI,  Kanl]  and  references  therein)  .  This  trans¬ 
formation  based  approach  provides  one  of  the  few  sys¬ 
tematic  means  of  d  signing  nonlinear  control  laws  and 
has  been  applied  to  numerous  research  problems  in¬ 
volving  stabilisation,  tracking,  adaptive  stabilisation 
and  adaptive  tracking.  While  most  of  these  applica¬ 
tions  might  be  valid  nominal  design  approaches,  the 
stability  of  resulting  closed-loop  systems  subject  to 
plant  uncertainties  and/or  disturbance  models  has  not 
received  comparable  attention. 

Towards  the  goal  of  establishing  robust  control  de¬ 
sign  methodologies  for  classes  of  nonlinear  plants,  a 
crucial  step  is  guaranteeing  stability  when  the  sys¬ 
tem  is  subject  to  persistently  exciting  bounded  distur¬ 
bances  ana  determining  bounds  on  such  disturbances 
for  which  stability  is  guaranteed.  In  other  words,  de¬ 
sign  methods  and  the  subsequent  analyses  must  be 
able  to  justify  stability  (be  it  local  or  global)  in  the 
presence  of  sensor  and/or  actuator  noise,  even  before 
bringing  in  more  demanding  robustness  to  plant  un¬ 
certainties. 

In  this  paper,  motivated  by  the  class  of  nonlin¬ 
ear  plants  that  can  be  rendered  LTI  under  nonlin¬ 
ear  algebraic  state-feedback,  we  focus  on  a  particular 
class  of  nonlinear  plants  that  can  be  rendered  LTI  un¬ 
der  nonlinear  tiable ,  possibly  dynamic,  feedback.  We 
study  the  stabilisation  of  this  particular  class  of  non¬ 
linear  plants.  The  study  is  based  on  an  input-output 
framework,  where  the  closed-loop  results  are  stated 
for  the  standard  unity-feedback  configuration  subject 
to  bounded  actuator  and  sensor  disturbances  [Desl] . 

The  results  are  organised  as  follows.  The  relation¬ 
ships  between  the  graphs  of  the  nonlinear  plant  and  its 
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transformed  LTI  counterpart  are  derived;  the  fact  that 
the  bounded  input-output  purs  of  the  two  maps  are  re¬ 
lated  by  stable  maps  motivates  the  generally  adopted 
design  procedure  based  on  undoing  the  nonuneanties, 
which  consists  of  the  following  steps: 

1.  transform  the  nonlinear  plant  to  an  LTI  one, 

2.  design  a  stabilising  controller  (possibly  nonlinear) 
for  the  LTI  plant  and 

3.  using  the  inverse  transformation  and  the  designed 
controller  for  the  LTI  plant,  determine  the  trans¬ 
formed  controller  for  the  nonlinear  plant. 

It  is  shown  that  this  straightforward  three  step  proce¬ 
dure  is  not  a  unified  stabmsmg  control  design  method 
for  the  class  of  plants  at  hand  when  the  plant  is  sub¬ 
ject  io  bounded  actuator /tent  or  disturbances .  In  other 
words,  given  any  member  of  the  particular  class  of 
plants  under  study,  taking  the  three  steps  mentioned 
above  does  not  necessarily  guarantee  a  stabilising  con¬ 
troller.  As  might  be  expected,  the  problem  arises  due 
to  the  second  step  of  the  design  procedure,  since  it  in¬ 
herently  assumes  that  the  design  for  the  transformed 
LTI  plant  and  the  nonlinear  plant  are  separate  prob¬ 
lems.  In  other  words,  any  stabilising  controller  for  the 
LTI  plant  may  not  yield  a  stabilising  controller  in  the 
third  step.  Subsequent  results  formalise  this  issue  and 
state  conditions  under  which  such  a  design  approach 
is  justified.  While  the  notion  of  incremental  stability, 
9uch  generalises  the  properties  of  linear  stable  maps, 
can  be  easily  imposed  to  guarantee  sufficient  condi¬ 
tions  for  the  method  to  work  in  general,  the  extreme 
conservatism  in  expecting  any  controller  in  the  second 
design  step  to  work  is  shown  by  establishing  that  incre¬ 
mental  stability  has  to  be  necessary  ,  as  well.  Unless 
such  restrictions  happen  to  be  satisfied  for  the  spe¬ 
cific  problem  at  hand,  all  of  these  cautionary  results 
imply  the  following:  The  design  problem  for  the  trans¬ 
formed  LTI  plant  need  not  be  decoupled  from  the  orig¬ 
inal  nonlinear  design  problem;  hence,  one  should  not 
rule  out  intentionally  nonlinear  controller  design  for 
the  transformed  LTI  plant.  Although  this  might  seem¬ 
ingly  defy  the  purpose  of  the  transformation,  since 
the  transformed  design  problem  is  yet  another  non¬ 
linear  control  design  problem,  it  also  emphasises  the 
need  for  more  results  in  nonlinear  control  design  for 
LTI  plants,  before  nonlinear  plants  are  transformed  to 
linear  ones.  Moreover,  despite  the  global  linearising 
transformation,  one  might  end  up  with  only  a  locally 
stable  interconnection.  Simple  examples  with  analyti¬ 
cal  derivations  are  used  to  emphasise  the  points. 

1.1  Notation  and  Preliminaries 

•  All  nonlinear  maps  in  this  study  are  causal,  multi¬ 
input  multi-output  and  defined  over  appropriate  prod¬ 
ucts  of  extended  L&,  (0,  oo)  spaces.  For  a  thorough 
treatment  of  general  extended  spaces  within  the  input- 
output  approach  to  ronlinear  systems,  see  e.g.,  (Deal] . 
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For  notational  convenience,  for  functions  from  JR+  to 
IRn  ,  the  associated  set  of  bounded  signals  and  the 
extended  space  will  ve  denoted  by  L"  and  L"  ,  re¬ 
spectively. 

While  most  of  the  observations  in  this  study  do 
generalize  to  other  extended  spaces,  the  particular 
choice  of  sup  bounded  functions  on  IR+  is  motivated  by 
studying  the  stability  properties  of  nonlinear  intercon¬ 
nections  subject  to  persistently  exciting  disturbances. 

An  n^-input  n„-output  causal  nonlinear  map  V  will 
be  considered  as 

V  ■  U  C  I-‘.  -  2."%  , 

where  U  denotes  the  domain.  The  extended  space 
is  a  means  of  incorporating  unbounded  signals  in  the 
study;  however,  one  must  note  that  although  L „  C 
Loot  i  (Loo)c  \  Loo*  ^  0  i  where  (-)c  denotes  the  com¬ 
plement  of  the  set  with  respect  to  the  set  of  all  func¬ 
tions  on  IR+  .  The  nonempty  intersection  arises  due 
to  discontinuities  which  are  not  jump-discontinuities. 
The  scope  of  the  extended  spaces  does  not  cover  signals 
which  exhibit  “finite  escape  time”:  hence  the  plant  de¬ 
scription  over  a  strictly  proper  subset  of  the  input  ex¬ 
tended  space  might  be  necessary.  Hence,  L%,  describes 
the  set  of  bounded  signals  and  Loot  \Loo  denotes  the 
set  of  unbounded  signals  (unbounded  at  infinity). 

•  Calligraphic  capital  letters  will  be  used  to  denote 
nonlinear  maps.  Italic  capital  letters  will  denote  lin¬ 
ear  time-invariant  maps  that  admit  finite-dimensional 
state-space  descriptions;  for  this  class,  with  a  slight 
abuse  of  notation,  the  map  and  its  associated  transfer 
function  representation  will  be  used  interchangeably. 
In  the  case  that  italic  letters  are  used  for  nonlinear 
algebraic  maps,  parentheses  will  be  included  to  em¬ 
phasise  the  evaluation  (i.e.,  Ax  vs.  F(x))  .  For  two 
nonlinear  causal  maps  T  and  G  the  map  JFQ  will  de¬ 
note  the  composition  of  the  two  maps. 

•  In  an  input-output  approach  to  the  analysis  and 
design  of  nonlinear  interconnections,  the  notions  of 
boundedness  and  stability  are  crucial  for  subsequent 
results.  Unlike  the  finite-dimensional  linear  time- 
invariant  case,  most  of  these  properties  depend  on  the 
particular  framework.  The  following  four  definitions 
set  up  the  particular  framework  in  this  paper.  For 
a  treatment  of  related  topics  in  fractional  representa¬ 
tions  of  nonlinear  casual  maps  and  stability  of  nonlin¬ 
ear  interconnections  see  e.g.,  [Haml,  Verl,  Vidl,  Kabl] 
and  references  therein. 

•  The  set  of  all  input  output  pairs  of  a  given  map 
does  admit  a  special  partitioning  which  exhibits  par¬ 
ticular  properties. 

Definition  1  (graph  of  V,  G(V)) 

For  a  given  map  V  :  U  C  L^t  -*  ,  the  graph 

of7>  is  denoted  by  G{V)  ,  where 

G{V)  =  {(u,7>u)  |  c  U  x  V(U)  .  o 

Note  that,  G{V)  admits  the  following  unique  parti¬ 
tioning 

G[P)  =  GbbIP)  u  gBU(v)  u  gUB(v)  u  gvu{v)  , 

where  the  subscripts  B  and  U  denote  the  bounded 
and  unbounded  components,  respectively.  The  com¬ 
ponents  are  given  by 

GbbCP)  =  Uu,Vu)  u6WniJj,Ptt€  L£j) 

Gbu\P)  =  flu,  Vu  tt£l/n  L2itV  u  e  L^t\L%} 
Ovb  V  =  ,{«,*>«!  u  eur\L%t\L%,Tu  e  l%} 

Guuiv)  =  {(u,T>u) 

€  LZAL: ;}  . 


In  the  bounded  and  unbounded  partitioning  of 
giP)  i  Gbb(V)  denotes  the  desired  i/o  pairs,  GBu(V) 
describes  the  instabilites  of  the  plant  (“unstable 
poles”)  and  Gub('P)  describes  the  instabilites  in  the 
inverse  relation  (“unstable  seros”)  .  Clearly,  a  feed¬ 
back  system  stabilises  V  subject  to  bounded  distur¬ 
bances  if  and  only  if  the  closed-loop  generates  plant 
i/o  pairs  in  GaB^P)  .  In  other  words,  GBB(V)  must 
be  nonempty.  For  example,  for  the  particular  map  V 
from  u  to  y  where  y  =  1  +  ua  ,  has  GBB(V)  =  t, 
hence  there  cannot  exist  any  stabilising  scheme. 

•  Unless  specifically  emphasised,  maps  are  not  con¬ 
sidered  to  be  stable  or  algebraic. 

Definition  2  (stable) 

A  causal  map  ft  :  L£t  -♦  L“-,  is  said  to  be  stable  iff 
there  exists  a  continuous  nonaecreasing  fa  :  JR+  — * 
1R+  such  that  ||Wu||  <  ta(IHI)  for  all  u  6  L% j  .  o 

For  a  stable  map  ft  ,  GBB{7i)  =  x  ft(I£)  ; 
Gbu{‘H)  =  •  ;  Gub(K)  may  or  may  not  be  empty. 
Definition  3  (unimodular) 

A  causal  map  ft  :  L* \  —  ££,  is  said  to  be  uni¬ 
modular  iff  ft  is  stable,  bijective  and  ft-1  is  stable, 
o 

For  a  unimodular  map  ft  ,  GBB(K)  =  XJj  x  L%  ; 

1  GUB^  ~  9  J  GuvVt>  ~  x 

Definition  4  (incrementally  stable) 

A  causal  map  7 i  :  is  said  to  be  in¬ 

crementally  stable  iff  ft  is  stable  and  there  exists  a 

continuous  nondecreasing  fa  :  IR+  — ♦  It,.  gUCh  that 

f°raU  “f  L~*  >  ||ft(v  +  v)-ftu||  <  £k(||v||)  for 
all  v  €  .  o 

Fot  an  incrementally  stable  map,  bounded  deviations 
in  the  input  result  m  bounded  deviations  at  the  out¬ 
put.  The  bound  on  the  output  deviation  is  indepen¬ 
dent  of  the  nominal  input  signal  u  . 

In  the  case  of  an  algebraic  incrementally  stable  map 

ft  where  the  bounding  function  fa  is  linear  ,  the  al¬ 
gebraic  nonlinearity  ft  is  also  referred  to  as  Lipschits 
continuous. 

•  From  now  on,  the  words  bounded  and  stable  will 
be  interpreted  in  the  sense  of  Loo  • 

•  The  nonlinear  unity-feedback  system  S(V,  C )  de¬ 
notes  the  interconnection 

y  =  Vu 

u  =  ux  -  C(u3  +  y)  , 

where  -si  and  tij  denote  the  exogenous  inputs  perturb¬ 
ing  the  actuator  and  sensor  signals  u  and  y  ,  respec¬ 
tively  (see  Figure  1) . 

•  The  feedback  system  S(V,  C)  is  said  to  be  stable 
iff  the  map  from  (ui,  tij)  to  (u,  y)  is  stable. 

Note  that  the  stability  of  S(V,C)  requires  that  the 
closed-loop  map  exists  and  it  is  stable.  The  well- 
posedness  condition  that  ensures  the  existence  of  the 
map  is  almost  always  satisfied  in  practice,  since  V 
and/or  C  are  strictly  causal. 

2  A  Class  of  Feedback  Lineariz- 
able  Plants 

Definition  5  (£) 

A  causal  nonlinear  plant  V  is  said  to  belong  to  the 
class  C.  iff  there  exist  a  unimodular  map  M,  a  linear 
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time-invariant  map  P  with  a  finite-dimensional  statc- 
•pace  representation,  and  a  stable  map  T  such  that 


2.  Cub(V)  =  TGub{P)  • 
Proof:  See  Appendix,  o 


V  =  M~lP(I-  TM~lP)~l  .  (1) 

In  order  to  emphasise  the  particular  triple  (M,P,T) 
used,  equation  (1)  will  be  denoted  by  V  = 
C(M,P,T).  o 


Note  that  if  V  =  C(M,P,T)  (see  Figure  1)  ,  the 
map  from  u  to  y  can  be  expressed  as 


V  : 


u  t— » 


Pv 

U  +  Ty 


Applying  the  stable  output-feedback 
u  =  Q-Ty 

the  map  from  V  to  My  is  rendered  linear.  Clearly,  for 
a  given  V  €  £  ,  the  triple  IM,  P,T)  is  not  uniquely 
determined.  The  well-poseaness  of  tne  feedback  loop 
in  the  description  of  V  is  inherently  assumed  by  the 
existence  of  the  map  from  u  to  y  ;  in  general,  strict 
causality  of  P  and/or  T  suffice  for  well-poaedness  of 
the  feedback  loop  in  C(M,  P,  T)  . 

This  particular  class  of  nonlinear  plants  in  Defini¬ 
tion  5  is  m  fact  motivated  by  the  special  class  of  nonlin¬ 
ear  plants  that  can  be  put  in  the  controllable  canonical 
form 


x  =  /(*)  +  flu 

C  =  *(*) 

c  =  AC  +  B(u  +  F{x))  , 

where  $  is  an  algebraic  change  of  coordinates  and  F  is 
an  algebraic  stable  map.  Note  that  for  this  particular 
case,  y  =  *  ,  My  =  *(y),  Ty  —  F(y)  and  P  =  (#7  — 
A)~lB  . 

For  plants  in  £  ,  the  linear  part  P  admits  coprime 
factorisations;  let 

P  =  ND~l  =  D~lN  (2) 

with  the  Besout  identity 


r  v 

V 

f  N 

-V  1 

[  ~D 

N  . 

l  D 

u  1 

where  all  of  the  eight  maps  are  stable.  By  linearity, 
it  ole  maps  are  also  incrementally  stable.  Note  also 
that  for  the  description  in  (2)  for  a  full  rank  P  ,  the 
numerators  drop  rank  at  the  seros  and  the  denomina¬ 
tors  drop  rank  at  the  poles.  Since  P  is  not  necessar¬ 
ily  single-input  single-output,  poles  and  seros  (except 
blocking  seros)  may  coincide.  Since  it  is  easier  to  de¬ 
scribe  the  stable  seros  and  unstable  poles  of  P  due 
to  the  transform  algebra,  one  can  easily  describe  the 
partitions  Gub(P)  and  Gbv{P)  •  The  following  re¬ 
sult  establishes  the  connection  between  the  graphs  of 
P  and  V  €  £  . 

Fact  6  (on  graphs  of  P  and  V) 

If  V  —  C(M  ,P,T)  ,  then  there  exists  a  unimodular 
map  T  such  that 

1.  Gbb{V)  =  TGBB(P)  , 


Note  that  Fact  6  establishes  that  the  desired  i/o 
pairs  and  the  unstable  dynamics  of  the  inverse  rela¬ 
tions  are  related  in  the  graphs  of  P  and  V  .  Such  a 
simple  description  between  the  rest  of  graph  partitions 
cannot  be  established  in  general. 

We  now  consider  the  stabilisation  of  plants  in  £  ; 

i.e.,  determining  a  causal  nonlinear  map  C  such  that 
the  feedback  system  S(V,C)  (see  Figure  1)  is  stable. 


Figure  1:  Feedback  system  S(V,C)  . 

The  existence  of  a  transformation  of  V  to  a  linear 
P  ,  and  Fact  6  suggests  stabilisation  schemes  based  on 
designing  controllers  for  P  .  The  following  conjectures 
emphasise  this  design  approach. 

Conjecture  7 

For  a  given  V  =  C(M,P,T) ,  \iS(P,CP)  is  stable  then 
S{V,  ( CPM  +  T))  is  stable,  o 

Conjecture  7  is  based  on  the  design  method  of  un¬ 
doing  the  nonlinear  maps  T  and  M  ,  which  consists 
of  the  following  steps: 

1.  transform  the  nonlinear  plant  to  an  LTI  one. 

2.  design  a  stabilising  controller  (possibly  nonlinear) 
for  the  LTI  plant  and 

3.  using  the  inverse  transformation  and  the  designed 
controller  for  the  LTI  plant,  determine  the  transformed 
controller  for  the  nonlinear  plant. 

Conjecture  8 

For  a  given  V  =  C(M,  P,T),  if  S(V,  C)  is  stable  then 
S(fl,  (C  -  T)M-*)  is  stable,  o 

If  Conjectures  7  and  8  were  in  fact  true,  one_  would 
end  up  with  a  necessary  and  sufficient  condition  for 
stabilising  plants  in  £  by  merely  using  the  transforma¬ 
tions  ana  nonlinear  controller  design  methods  based 
on  linear  plants.  We  now  investigate  these  conjec¬ 
tures  and  state  conditions  (other  than  trivial  cases  like 
V  =  P  =  £( I.  P,  0))  ,  under  which  related  results  can 
be  established. 

3  Stabilizing  Feedback 

Linear izable  Plants 

The  following  fact  (see  e.g.,  [Kabl])  describes  the  set  of 
all  nonlinear  stabilising  compensators  CP  in  5(P,  CP)  . 

Fact  9  (S{P)) 

Let  the  causal  linear  map  P  satisfy  equations  (2)  and 
(3)  .  Under  these  assumptions,  the  feedback  system 
5(P,Cj>)  is  stable  if  and  only  if  CP  €  «S(P)  ,  where 
S(P)  =  {( V  +  DQ)(V  -  NQ)~l  |  Q  is  stable  and 
(V  —  AT Q)is  bijective  with  a  causal  inverse}  o 
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The  following  Lemma  justifies  the  design  approach 
in  Conjecture  7  for  a  special  subset  of  plants  in  £  . 

Lemma  10  (A  Sufficient  Condition  for  Stabilis¬ 
ing  C(M,P,P)) 

Let  V  =  C(M,  P,P)  ,  where  the  maps  P  and  M  are 
both  incrementally  stable.  Under  these  assumptions, 
if  S(P,Cp)  is  stable  then  S(V,  ( CpM  +  P))  is  stable. 
Proof:  See  Appendix,  o 

The  existence  of  a  unimodular  map  that  relates  the 
desired  input  output  pairs  of  P  and  V  (see  Fact  6-1)  , 
is  the  mam  motivator  for  Conjectures  7  and  8  .  The 
second  part  of  Fact  6  relates  the  “unstable  sero  dy¬ 
namics”  of  P  and  V  .  For  a  precise  description  of  the 
notion  of  sero  dynamics  in  nonlinear  systems  that  ex¬ 
tend  the  well-known  case  in  finite-dimensional  linear 
time-invariant  maps,  see  [Isil]  .  Unlike  the  motivat¬ 
ing  controllable  canonical  form  derivation,  the  class  of 
nonlinear  plants  in  £  can  have  “unstable  sero  dynam¬ 
ics”  .  In  fact.  Lemma  10  does  apply  to  such  cases  since 
one  can  easily  stabilise  P  with  unstable  seros.  The 
advantage  of  the  class  £  is  to  take  the  stabilisation 
concept  of  algebraic-feedback  linearisable  systems  un¬ 
der  full-state  feedback  (which  relies  on  constructing  an 
output  that  has  the  sufficient  relative  degree  to  avoid 
“sero  dynamics")  to  a  larger  class  where  the  input  out¬ 
put  map  already  has  “unstable  sero  dynamics"  .  The 
virtue  of  Fact  6-2  is  that  one  does  not  need  to  deter¬ 
mine  the  stability  properties  of  the  "sero  dynamics" 
of  the  nonlinear  map  V  ,  which  is  a  highly  nontrivial 
task. 

We  now  present  a  counterexample  to  Conjecture  7  . 
Example  11 

Consider  the  single  state  plant  model  V  from  u  to  y 
described  by 

y  =  y2  +  u  . 

Note  that  V  —  £(/,  1/s,  (-)2)  •  Choose  Cp  —  I  € 
S(P)  .  For  C  =  CpM  +  P  =  I  +  (-)a  ,  the  dosedloop 
system  S(V,  C)  is  described  by 

y  =  -(y  +  ua)  +  y2  -  (y  +  «a)3  +  • 

Clearly,  for  ua  =  0  ,  the  closed-loop  map  from  ui  to 
(u,  y)  in  S(V,  C)  is  stable.  However,  for  ua  /  0  ,  the 
closed-loop  system  S( V,  C)  is  not  stable.  To  see  this, 

consider  ui  =  6i  and  uj  —  6 a  where  6i  =  0  ,  i  =  1, 2  . 
In  other  words,  under  constant  sensor  and  actuator 
bias,  the  closed-loop  system  is  described  by 

y  =  —(1  +  2£a)y  +  —  63  —  6$ 

Hence,  for  63  <  —0.5  ,  the  output  y  is  not  bounded,  o 


Example  1 1  emphasizes  the  importance  of  checking 
the  closed-loop  map  from  (ui.ua)  to  (u,y)  ,  rather 
than  just  ui  to  (u,  y)  .  The  same  example  also  illus¬ 
trates  the  effect  of  estimated  state  feedback  and  how 
the  lack  of  a  separation  principle  for  nonlinear  systems 
renders  the  design  procedure  heuristic,  which  happens 
to  fail  for  this  particular  case.  When  an  estimator  with 
steady-state  bounded  bias  is  introduced  in  the  ideal 
state-feedback,  the  closed-loop  is  no  longer  stable. 

The  problems  with  Conjectures  7  and  8  are  mainly 
due  to  ignoring  the  exogenous  input  uj  ,  and  relying 
on  the  following  fact. 


Fact  12  (Conjectures  7  and  8  under  no  mea¬ 
surement  noise) 

For  a  given  V  =  C(M,  P,  P)  ,  S(P,  Cp)  |Uj=0  is  stable 
if  and  only  if  S(V,CpM  +  P)  |u,=o  is  stable,  o 

The  instability  mechanism  in  Example  11  can  also 
be  used  to  establish  the  lack  of  performance  robust¬ 
ness  under  finite-energy  disturbances,  where  ui  and 
uj  — •  0  .  Given  any  bound  on  the  amplitude  of  y  , 
there  exists  a  finite-duration  disturbance  (hence  finite- 
energy)  such  that  the  closed-loop  output  exceeds  this 
bound  before  asymptotically  reaching  sero. 

We  now  state  the  main  result  that  shows  the  con¬ 
servatism  of  basing  the  stabilisation  of  V  on  the  stabi¬ 
lisation  of  P  .  The  following  theorem  establishes  that 
Conjecture  7  is  not  true  in  general  and  that  incremen¬ 
tal  stability  is  of  paramount  importance. 

Theorem  13  (Conservatism  of  Conjecture  7) 

Let  V  =  £(M,P,P)  with  M  incrementally  stable. 
Under  these  assumptions, 

S(V ,  (CpM  +  .F))  is  stable  for  all  CP  6  S(P)  ,  if  and 
only  if  P  is  incrementally  stable. 

Proof:  See  Appendix,  o 

Note  that  Theorem  13  emphasises  that  the  design 
procedure  is  no  longer  decoupled  (namely  first  stabi¬ 
lise  P  and  then  apply  the  transformed  controller  to 
V)  if  the  incremental  stability  of  P  fails.  When  incre¬ 
mental  stability  fails,  Theorem  13  does  not  imply  that 
there  does  not  exist  a  compensator  for  P  for  which 
Conjecture  7  is  valid.  On  the  contrary,  the  selection  of 
Cp  is  crucial  as  emphasized  in  the  following  example. 

Example  14 

Consider  the  plant  V  in  Example  11  .  Choose  Cj>  = 
I  +  (■)*  .  Note  that  Cp  €  «S(l/s)  (see  Appendix)  . 
Apply  the  heuristic  design  procedure  in  Conjecture  7  , 
i.e.,  moose  C  =  CpM  +  P  .  The  closed-loop  system 
S(V,  I  +  (*)3  +  (•)*)  is  stable  (see  Appendix)  .  o 

In  order  to  show  how  restrictive  it  is  to  impose  M 
and  P  are  both  incrementally  stable,  consider  the  fol¬ 
lowing  example. 

Example  15 

Consider  the  map  V  from  u  to  y  described  by 

*1  =  *a  +  /(*  1) 

*2  =  *3 

*3  =  u 

x(0)  =  0 
y  =  x  , 

where  the  algebraic  nonlinearity  f  is  bounded,  i.e., 
there  exists  ax  >  0  such  that  |/(x)|  <  k  for  all  x  6 1R . 
Clearly,  /(•)  ,  is  stable  and  incrementally  stable.  More¬ 
over,  for  any  linear  time-invariant  finite-dimensional  C 
such  that  S(l/s3[l  s  #J  ]T,  C)  is  stable,  the  intercon¬ 
nection  S(P,  C)  is  stable.  Now  consider  the  plant  V 
in  terms  of  the  canonical  form  £  .  Provided  that  / 
is  at  least  twice  differentiable,  under  the  coordinate 
transformation 


M  :  x  t-f 


*1 

*2  +  /(*  1) 

.  *3  +  +  /(xi)/*1* (*i) 


and  algebraic  feedback  Px  =  x\  f^\x i)+*3 
2*2/(*i)/(3,(*i)  +  *2(/W(*i))*  +  /(*  »)(/*l,(*i))a  + 

f  1  1 


/3(®i)/^3^(xi)  ,  one  has  V  =  C{M,  £ 


t 

.a 


Clearly,  M  is  unimodular  and  P  is  stable.  However, 
unless  /(*)  vanishes,  P  is  not  incrementally  stable,  o 
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4  Conclusion 

In  the  course  of  bounded-input  bounded-ouput  stabi¬ 
lisation  of  nonlinear  plants,  those  that  can  be  trans¬ 
formed  into  an  LTI  one  are  of  particular  interest  due  to 
a  vast  pool  of  theoretical  and  computational  tools  asso¬ 
ciated  with  LTI  design  problems.  The  existing  trans¬ 
formation  immediately  suggests  a  simple  design  ap¬ 
proach:  design  a  stabilising  control  law  for  the  trans¬ 
formed  LTI  plant  and  then  use  the  transformed  control 
law  for  the  nonlinear  plant.  As  it  is  shown  in  the  study, 
such  an  approach  is  not  a  sure  design  procedure  unless 
more  restrictions  are  imposed  on  the  transformation. 
The  design  for  the  transformed  LTI  plant  is  really  not 
decoupled  from  the  original  nonlinear  design  problem; 
hence,  one  should  not  rule  out  intentionally  nonlinear 
controller  design  for  the  transformed  LTI  plant.  Al¬ 
though  this  might  seemingly  defy  the  purpose  of  the 
transformation,  since  the  transformed  design  problem 
is  yet  another  nonlinear  control  design  problem,  it  also 
emphasises  the  need  for  more  results  in  nonlinear  con¬ 
trol  design  for  LTI  plants.  Along  this  line,  for  a  given 
LTI  plant,  one  has  a  complete  parametrisation  of  all 
nonlinear  stabilising  controllers  ;  however,  the  choice 
of  the  parameter  subject  to  a  particular  cost  is  not 
straightforward.  Another  approach  is  baaed  on  opti¬ 
mal  control  with  non-quadratic  integrand  cost  criteria 
from  which  nonlinear  state-feedback  laws  are  derived. 

5  Appendix 

Proof  of  Fact  6 

Let  V  =  £(M,  P,  T) .  From  Figure  1 ,  the  equations 
relating  the  (u,  y)  pair  and  (v,  Pv)  pair  define  the  map 
T  as  follows: 

T(v,  Pv)  =  (u,  y)  =  (v  -  TM~lPv,  M~lPv) 

(ti,Pv)  =  T~l{  u,y)  =  -(u+Ty,My)  . 

Since  T  is  stable  and  M  is  unimodular,  T  defined 
above  is  also  unimodular.  By  unimodularity  of  T  , 

(u,y )eQBB(P)  <=>  (v,  Pv)  e  Gbb(P)  , 

which  establishes  Fact  6  1). 

Let  (u,  y)  €  Gub(P)  •  Since  M  is  stable,  Pv  =  My 
is  bounded.  Since  u  is  unbounded  and  T  M~x  Pv  is 
bounded,  v  =  u  +  TM~xPv  is  unbounded.  Hence 
( v,Pv )  €  Gub[P)  •  Similarly,  v  unbounded,  Pv 
bounded  imply  y  bounded  and  u  +  Ty  unbounded, 
hence  u  is  unbounded.  This  establishes  part  2)  .  □ 
Proof  of  Lemma  10 

By  Fact  9  ,  the  stability  of  S(P,Cp)  implies  that 
Cp  =  (U  +  DQ)(V  —  N Q)~l  ,  for  some  stable  Q  .  Any 
input  output  pair  corresponding  to  the  map  Cp  can 
be  expressed  as  the  pair  ((V  —  N Q)(,  (V  +  DQ) 0  , 
where  {  denotes  the  pseudo-state  of  this  particular 
factorisation.  For  this  particular  factorisation,  the  in¬ 
put  output  pair  of  Cp  is  bounded  if  and  only  if  (  is 
bounded.  Moreover,  since  T,  M  and  M~l  are  stable, 
the  closed-loop  system  S(V,  (CpM  +  T))  is  stable  if 
and  only  if  the  closed-loop  map  from  (ui.u2)  to  £  is 
stable.  Writing  the  equations  describing  the  feedback 
system  S(V,  ( CpM  +  ?))  in  terms  of  (,  we  obtain  (see 
Figure  1  and  equation  (2)) 


Adding  DM(u3  +  y)  to  both  sides  of  (4),  substituting 

(5)  and  (3)  and  using  the  linearity  of  N  and  D  ,  we 
obtain 

(,  =  Nul-N(7r(y+u3)-Ty)  +  D(M(y+v3)-My)  . 

Clearly,  if  T  and  M  are  incrementally  stable,  (  is 
bounded  for  all  bounded  ui  and  u2  .  O 
Proof  of  Theorem  1 

“if”  Follows  by  Lemma  10. 

“only  if"  Follows  by  Example  11  ,  by  setting  M  = 
I,P=  1/s,  T  =  (•)*  and  CP  =  1  .  O 
Proof  of  the  claim  in  Example  14 

The  dosed  loop  system  S(l/s,  I  +  (•)*)  is  described 

by 

x  =  Ui  -  (x  +  u3)  -  (x  -f  Uj)3  . 

The  dosed-loop  system  S(P,  J+(-)J+(-)3)  is  described 
by 

Z  =  «!  +  **  -  (*  +  Uj)  —  (x  +  Uj)a  -  (X  +  U2)S  . 

For  ui  and  u2  in  IM  ,  using  the  Lyapunov  function 
candidate  V(x)  =  x3  ,  in  both  cases  for  |x|  sufficiently 
large  V  along  the  solution  trajectories  is  eventually 
negative.  Hence  the  dosed-loop  map  from  (u2,u2)  to 
(u,  x)  is  stable  in  both  cases. 
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Appendix  C 


Appendix  C  includes  a  regular  invited  paper  presented  at  the  1993  Automatic  Control  Con¬ 
ference. 

“Real-Time  Implementation  Issues  in  Nonlinear  Model  Inversion,” 

M.  Giintekin  Kabuli,  Sudarshan  P.  Bhat  and  Robert  L.  Kosut, 

Proceedings  of  the  American  Control  Conference , 
pp.  547-551,  San  Francisco,  June  1993. 
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Real-Time  Implementation  Issues  in  Nonlinear  Model 

Inversion 


M.  Guntekin  Kabuli  Sudarshan  P.  Bhat  Robert  L.  Y.~:  ut  * 
Integrated  Systems  Inc.,  3260  Jay  Street,  Santa  Clara,  CA  95054-3309 


Abstract 

A  real-time  control  problem  associated  with  a  flex¬ 
ible  testbed  fixture  is  considered.  The  regulated  vari¬ 
able  is  the  deflection  at  the  tip  of  a  flexible  beam  at¬ 
tached  to  an  inertia  wheel  base  which  is  subject  to 
a  disturbance  torque.  The  control  torque  affects  the 
base  wheel  through  a  coupling  which  has  both  com¬ 
pliance  and  variable  backlash.  The  performance  spec¬ 
ifications  include  disturbance  attenuation  and  rapid 
slewing  at  the  tip.  The  designs  are  based  on  an  analyt¬ 
ically  derived  continuous-time  model  which  is  tuned  in 
accordance  with  the  measured  data.  The  model  con¬ 
sists  of  an  interconnection  of  a  linear  time-invariant 
part  and  piecewise-linear  algebraic  nonlinearities  mod¬ 
eling  friction  and  backlash.  The  candidate  designs 
must  be  based  on  available  measurements  only;  more¬ 
over,  the  final  discrete-time  control  law  must  be  exe¬ 
cutable  with  the  real-time  controller  hardware  limits 
tions  at  hand. 

Introduction 

The  first  design  approach  is  based  on  the  linear 
model  where  the  nonlinearities  due  to  friction  and 
backlash  are  set  to  zero.  A  single  tone  additive  actua¬ 
tion  disturbance  is  attenuated  at  the  regulated  output 
by  incorporating  the  disturbance  model  in  the  feed¬ 
back  loop.  Designs  are  based  on  a  discretized  plant 
model  and  implemented  on  an  Integrated  Systems  AC- 
100  real-time  control  processor. 

The  second  approach  is  a  case  study  based  on 
feedback  linearization  where  a  single  parametric  non¬ 
linearity  is  included  in  the  design  model.  Due  to 
the  nonminimum-phase  zero  dynamics  characteristics 
from  the  actuation  to  the  tip  of  the  flexible  beam,  the 
study  focuses  on  the  base  wheel  position  and  slewing 
performance.  We  study  the  sensitivity  of  two  certainty 
equivalence  based  control  laws  for  the  frozen  parame¬ 
ter  case.  Based  on  the  least  sensitive  non-adaptive  de¬ 
sign,  an  adaptive  slewing  controller  is  designed.  This 
approach  for  slewing  design  is  later  modified  to  obtain 
a  baseline  design  for  attenuating  a  measured  multi- 
tone  disturbance.  The  disturbance  model  is  based  on  a 
periodic  torque  waveform  affecting  the  wheel  base  after 
the  coupling  driven  by  the  actuator  motor.  The  result¬ 
ing  continuous  time  control  law  was  simulated  to  verify 
the  performance  enhancement  due  to  the  signal  gen¬ 
erator  in  the  proposed  feedforward/feedback  scheme. 
One  of  the  major  tasks  for  real-time  implemen labil¬ 
ity  was  to  obtain  a  discrete-time  approximation  to  the 
continuous-time  simulations  of  the  nonlinear  control 
law  subject  to  the  sampling  rate  and  computational 
K  ^  'imitations  imposed  by  the  real-time  controller 
hardware.  The  approximate  discretization  was  down- 
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loaded  to  the  real-time  controller  processor  and  the 
continuous-time  versus  discrete-time  simulation  com¬ 
parison  proved  to  be  satisfactory.  The  disturbance 
rejection  part  is  a  baseline  design  which  is  currently 
unimplementable  unless  the  disturbance  can  be  mea¬ 
sured;  however,  the  current  discrete-time  control  law 
is  an  implementable  candidate  for  real-time  slewing 
tests. 

Design  Approach  I 

An  adaptive  controller  was  designed  for  rejecting 
single  tone  sinusoidal  disturbance  in  a  prespecified  fre¬ 
quency  range.  The  controller  consists  of  an  inner- 
loop,  robustly  stabilising  component  and  an  outer- 
loop,  adaptive  error  rejection  component.  The  control 
design  was  accomplished  in  three  steps  (see  Figure  1)  . 


sinusoid 

disturbance 


Figure  1: 

The  first  step  was  the  design  of  the  inner  loop  con¬ 
troller.  A  10  hz  low  pass  Alter  was  included  in  the  loop 
to  avoid  any  destabilizing  loopgain  in  excess  of  10  hz. 
This  underscores  the  objective  of  actively  controlling 
the  rigid  body  slew  mode  and  the  flrst  flexible  mode 
(around  4.7  hz  frequency  range).  This  will  also  result 
in  the  flexible  modes  in  excess  of  10  hz  to  be  unregu¬ 
lated.  The  stabilizing  ‘LQG/LTR  controller  I*  in  the 
innerloop  was  designed  using  the  LQG/LTR  design  ap¬ 
proach  based  on  the  input/output  relation  from  ‘1*  to 
‘2’  (see  Figure  1)  in  the  inner-loop  in  the  absence  of 
the  outer  loop;  note  that  this  requires  an  LQG/LTR 
design  for  an  augmented  plant  which  includes  the  ac¬ 
tual  plant  cascaded  by  the  10  hz  low  pass  Alter  at  the 
input. 
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The  second  step  was  the  design  of  the  outer  loop 
controller  for  rejection  of  a  nominal  single  tone  sinu¬ 
soid  disturbance  frequency  of  4  h z  on  plant.  The  outer 
error  rejection  loop  contains  a  frequency  shaping  block 
that  contains  an  undamped  pole  at  4  hz.  This  un¬ 
damped  pole  provides  an  infinite  gain  at  4  hz  which  in 
a  stabilizing  loop  assures  perfect  rejection  of  a  single 
tone,  4  hz,  sinusoid  disturbance  on  the  plant.  A  20 
dB  attenuation  on  both  directions  from  the  4  hz  fre¬ 
quency  assures  the  retaining  of  inner-loop  characteris¬ 
tics  outside  the  4  hz  range.  The  stabilizing  ‘LQG/LTR 
controller  II’  in  the  outerloop  was  designed  using  the 
LQG/LTR  design  approach,  just  like  in  the  inner-loop, 
based  on  the  input/output  relation  from  ‘3’  to  ‘4’  (see 
Figure  1)  in  the  presence  of  the  inner-loop.  Figure  2 
shows  the  LQG/LTR  loop  gain  through  the  plant  with 
the  two  stages  of  control  (solid  line)  and  with  only  the 
inner  stage  of  control  (dotted  line).  The  corresponding 
sensitivities  are  shown  in  Figure  3  . 


Figure  2: 


Figure  3: 

The  third  step  was  the  adaptive  relocation  of  the 
undamped  poic  in  the  outer-loop  shaping  filter  to  the 


estimated  frequency  of  disturbance.  When  the  esti¬ 
mated  frequency  is  equal  to  the  actual  frequency  of 
a  sinusoid  disturbance  and  the  resulting  outer-loop  is 
stable,  a  perfect  disturbance  rejection  is  guaranteed. 
The  frequency  of  disturbance  was  estimated  using  an 
extended  Kalman  filter  based  on  a  notch  filter.  It  is 
emphasized  again  that  the  only  change  from  the  two 
stage  controller  designed  above  is  the  adaptive  tuning 
of  the  shaping  filter  in  the  outer  loop.  The  entire  inner 
loop  and  the  *  LQG/LTR  controller  II1  in  the  outer  loop 
remain  unchanged.  Such  an  adaptive  tuning  was  facil¬ 
itated  by  the  fact  that  the  two  stage  loop  was  found 
analytically  stable  for  the  filter  peak  placed  anywhere 
in  the  range  2  to  7.  hr. 

Figures  4  and  5  are  plots  generated-  from  the  data 
captured  during  a  real-time  implementation  of  the 
adaptive  controller  on  the  testbed  fixture.  A  step 
response  of  the  adaptive  controller  is  shown  in  Fig¬ 
ure  4  .  The  top  seven  strip  charts  plot  the  sensor  mea¬ 
surement  data  from  motor  encoder,  backlash  encoder, 
i wheel  encoder,  strain  gage  I,  strain  gage  2,  motor  ve¬ 
locity  and  tip  accelerometer  respectively.  The  bottom 
strip  chart  refers  to  the  actuation  voltage  data  to  the 
motor.  Figure  5  shows  the  same  sensor  measurements 
(top  seven  strip  charts)  for  a  steady  0.6  V  sinusoid 
disturbance  at  the  actuator  with  the  disturbance  fre¬ 
quency  undergoing  step  changes  (in  hs)  from  from  4 
to  3.75,  4  and  4.25  (8th  strip).  The  9th  strip  shows 
the  real  time  estimate  of  the  disturbance  frequency. 
The  strip  data  shows  a  disturbance  buildup  immedi¬ 
ately  following  the  step  change  in  the  disturbance  fre¬ 
quency.  This  is  followed  by  disturbance  attenuation 
which  occurs  as  the  frequency  of  disturbance  estimator 
locks  on  to  the  true  frequency  of  the  sinusoid  distur¬ 
bance.  Adaptation  and  complete  disturbance  rejection 
is  accomplished  within  5  to  10  seconds  following  a  step 
change  in' the  disturbance  frequency. 
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Figure  4: 

The  adaptive  controller  presented  here  was  not  able 
to  handle  disturbance  rejection  on  the  testbed  fixture 
in  the  range  4.6  to  4.8  hz  (also  confirmed  from  analyt¬ 
ical  simulations).  This  behavior  can  be  attributed  to 
the  presence  of  a  lightly  damped  flexible  mode  around 
4.7  hz.  The  adaptation  mechanism  could  lock  on  the 
disturbance  frequency  only  if  the  sensor  measurements 
were  dominated  by  the  single  tone  of  the  disturbance- 
With  the  disturbance  frequency  in  the  4.6  to  4.8  b* 
range,  the  sensor  measurements  also  contained  a  sig¬ 
nificant  contribution  from  the  lightly  damped  mode. 
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Figure  5: 
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This  situation  resulted  in  the  frequency  estimator  flip- 
flop-ing  between  the  two  dominant  frequencies,  result¬ 
ing  in  ringing. 

Design  Approach  2 

Consider  the  nonlinear  plant  model  shown  in  Fig¬ 
ure  6  ,  where  G  denotes  the  known  linear  part  and  T 
denotes  the  algebraic  nonlinearity  (e.g.,  friction,  satu¬ 
ration)  which  depends  on  an  unknown  parameter  9  . 
The  signals  t u  and  v„  at  the  input  and  output  of  the 
nonlinearity  are  unavailable  as  measurements.  The 
disturbances  and  regulated  variables  are  denoted  by  to 
and  z  ,  respectively.  The  goal  is  to  determine  a  con¬ 
trol  input  u  such  that  using  only  the  measurement  at 
y,  the  output  y  asymptotically  tracks  a  desired  refer¬ 
ence  signal. 


Figure  6: 


The  adaptive  scheme  we  are  investigating  is  based 
on  a  feedforward-feedback  configuration  (see  Figure  7). 
This  configuration  gives  rise  to  two  nontrivial  sub- 
problems:  1)  Sensitivity  of  the  non-adaptive  design, 
i.e.,  the  stability  and  tracking  robustness  of  the  closed- 

loop  subject  to  perturbations  of  0  about  the  nominal 
9  and  2)  Parameter  estimation.  Without  closing  the 
loop  via  the  parameter  estimator  in  Figure  7,  we  first 

design  a  tracking  controller  based  on  a  known  signal  0. 
The  motivation  behind  the  sensitivity  study  is  that  for 
sufficiently  slow  adaptation,  one  recovers  the  perfor¬ 
mance  of  the  inherent  tracking  design.  Hence,  the  goal 
is  to  determine  the  least-sensitive  non-adaptive  design. 
So  far,  our  studies  have  shown  that  among  other  track¬ 
ing  designs  based  on  the  certainty-equivalence  princi¬ 
ple  (use  the  parameter  estimate  as  if  it  were  correct), 
ine  underlying  design  in  Figure  7  is  consistently  the 

<'-ast  sensitive.  For  a  known  parameter  estimate  9  , 
a  candidate  input-output  pair  (u,y)  is  generated  us- 

£he  model  in  Figure  6  with  9  replaced  by  9  .  The 
•  racking  controller  Ct  achieves  the  desired  tracking  of 

."]?*  r*^erence  signal  r  in  the  loop  with  the  model  V  . 
_  *  input-output  pair  (u,  y)  is  injected  to  a  feedback 
V-siem  where  Cp  stabilizes  the  actual  plant;  Cp  alone 


need  not  be  a  tracking  compensator.  This  approach 
improves  the  “performance  bandwidth”  without  in¬ 
creasing  the  “control  bandwidth”  of  the  feedback  sys¬ 
tem.  Hence  it  has  the  potential  of  improving  existing 
controller.  p.erXw.mMSf.  a?  i  t.b.<wi.t  a  .tatal .  r.cdes.i&n 
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Figure  7: 


The  plant  model  is  of  the  form 
*  =  Ax  +  buu  +  6„(to  —  9f(y)) 
y  =  CyZ 

where  the  10  state,  relative  degree  3  SISO  system 
(cy, A, bu)  is  minimum-phase  with  a  lightly  damped 
sero  at  3.25  Hz  .  The  algebraic  nonlinearity  /(•)  = 
sati(40-)  ,  where  satj  is  the  odd  symmetric  piecewise- 
linear  saturation  function  with  unity  slope  and  ±1 
saturation  limits.  In  the  rest  of  this  study  /(-)  = 
tanh(40-)  will  denote  the  smooth  approximation  to  /  . 
The  periodic  disturbance  torque  is  denoted  by  w  .  Us¬ 
ing  y,  y,  yW  ,  the  control  u  determined  by 
fau  =  «|  -  a3x  -  A(«»  -  6f( y)) 

+0i(0f(y)+0f{lW,)-w)  , 
for  appropriate  coefficients  a  and  0  ,  renders  t/3^  =  uj . 
Let  £  denote  the  associated  map  (i.e.,  from  x,  w,  to,  0, 
f  to  Uf)  .  Note  that  £  can  only  be  realized  in  the  ideal 
case  wnere  x,  to,  t b  ,  9  and  /  are  all  known.  Although 
such  is  not  the  case,  it  still  is  interesting  to  note  that 
the  steady-state  actuation  demand  to  decouple  y  from 
w  is  given  by  ^(ftw  +  0i^>)  ■  For  the  particular  pe¬ 
riodic  w  and  actuation  limits  at  hand,  this  translates 
into  disturbance  rejection  of  up  to  7  harmonics  of  to  . 

Similarly  let  £  denote  the  map  depending  on  the  avail¬ 
able  signals  x,  to  ,  to,  6  and  the  smooth  approximation 
/• 


Figure  8: 


Sensitivity  of  the  nominal  design 

Consider  the  interconnection  in  Figure  8  ,  and  a  10 
mrad  slew  with  the  desired  tracking  performance  for 

£  =  £  as  shown  in  Figure  9  (lower  curves)  .  The  solid 
and  dashed  lines  denote  the  base  and  tip  positions, 
respectively.  The  periodic  disturbances  w  and  to  are 
exactly  canceled.  Note  that  the  feedback  lineariza¬ 
tion  results  in  the  inversion  which  causes  the  lightly 
damped  3.25  Hz  mode.  In  the  upper  curves  the  mis¬ 
match  is  only  f  vs  /  ;  i.e,  r,  to,  to,  0  are  exactly  used  in 
the  feedback  linearizer  except  that  the  algebraic  non- 
linearities  are  not  identical.  Performance  degradation 
is  self-explanatory. 


Due  to  the  sensitivity  of  the  one-loop  implementa¬ 
tion  of  the  certainty-equivalence  principle  in  Figure  8  , 
we  resorted  to  the  certainty-equivalence  based  control 
scheme  as  shown  in  Figure  10  (redrawn  from  Figure  7 
for  ease  of  comparison  with  Figure  8)  .  A  simple  con¬ 
troller  C  =  10  stabilized  the  inner  loop.  The  perfor¬ 
mance,  on  the  other  hand,  depends  on  the  generated 
signals  (u,  y)  .  Our  design  studies  have  shown  that  the 
model-follower  scheme  in  Figure  10  consistently  proves 
to  be  the  least  sensitive  to  mismatches.  These  obser¬ 
vations  are  mainly  due  to  the  fact  that  the  inner-loop 
is  inherently  stable  and  the  performance  depends  on 
the  cascade  structure.  Howevex,  in  Figure  8  ,  both  sta¬ 
bility  and  tracking  performance  have  to  be  achieved  in 
one  loop. 


Figure  10: 

Adaptive  Slewing  with  w  =  0 

A  parameter  estimation  block  is  introduced  in  the 
certainty-equivalence  based  tracking  control  design 
scheme  in  Figure  10  ,  to  complete  the  interconnec¬ 
tion  as  shown  in  Figure  7  .  The  disturbance  w  and 
the  initial  conditions  z(0)  =  z(0)  are  assumed  to  be 

zero  ;  however,  8  ^  6  and  /  ^  /  .  Moreover,  the 
state  x  is  not  being  used  in  the  control  law  and  the 
parameter  estimator  solely  depends  on  available  sig¬ 
nals  u  and  y  .  In  other  words,  for  the  disturbance  free 
case,  the  interconnections  in  Figure  10  and  Figure  7 
are  candidates  for  implemcniable  designs  since  they 
depend  on  available  signals  only.  However,  real-time 
implementation  requires  the  extra  step  of  control  law 
discretization,  which  proves  to  be  a  nontrivial  task  as 
it  will  be  emphasized  later  on. 

Consider  Figure  11  .  The  dashed  line  denotes  the 
the  signal  generator  output  trajectory  (i.e.,  y)  and  the 
solid  line  denotes  the  basewheel  yaw  (i.e.,  y)  .  The  ref¬ 
erence  signal  r  is  a  smooth  ±20  mrad  periodic  wave¬ 
form  which  y  tracks  after  4  seconds.  Note  that  the 
output  y  tracks  r  within  1  mrad  after  approximately  6 
seconds.  There  are  two  important  points  to  be  made 
about  these  responses:  I)  the  signal  generator  depends 


on  8  which  is  not  constant  (  depends  on  the  parame¬ 
ter  estimator  output  waveform  ).  Moreover,  unlike  the 
case  in  Figure  10  ,  when  the  parameter  estimator  loop 
is  closed,  8  is  no  longer  an  exogenous  signal  for  the 
signal  generator,  hence  for  slow  adaptation  (over  the 
6  second  window),  y  also  exhibits  transient  response 
(see  dashed  lines  over  the  first  4  seconds  in  Figure  11)  . 

When  6  is  an  exogenous  signal,  the  signal  generator  is 
designed  so  that  y  tracks  r  in  less  than  a  second.  2) 

due  to  /  and  /  mismatch,  even  with  6  =  8,  the  ideal 
tracking  performance  in  the  lower  solid  line  in  Figure  8 
degrades  to  1  mrad  off  at  2  seconds  ;  hence  the  offset 
emphasized  by  parallel  lines  in  Figure  H  .  The  param¬ 
eter  estimator  output  (i.e.,  8)  is  shown  in  Figure  12  . 


Figure  11: 


Figure  12: 

Baseline  disturbance  rejection  design  and  dis¬ 
cretization  issues 

In  order  to  illustrate  the  approach,  consider  the  case 
where  the  interconnection  in  Figure  10  is  LTI  and  mod¬ 
ified  to  incorporate  the  disturbance  w  ;  the  disturbance 
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estimate  used  in  the  signal  generator  will  be  denoted 
by  w  ■  The  LTI  plant  from  (ui,  u)  to  y  is  denoted  by 
(/V  .  Pv«)  ant*  C  denotes  an  LTI  stabilizing  compen¬ 
sator.  Let 

y  =  Pyu(u  +  C(y  -  y))  +  Pvww 
y  =  PyuW  +  PyupUl  , 

where  y  denotes  the  desired  plant  output.  Let  ev  and 
ew  denote  the  errors  (y-y)  and  (uj  -ii)  ,  respectively. 
Then 

ty  =  (/  +  PyUC)~l  PyWew  . 

Clearly,  there  are  two  options:  1)  relying  on  the  poles 
of  PyuC  at  the  disturbance  poles  or  2)  choosing  C  to 
be  any  stabilizing  controller  and  trying  to  set  ew  — •  0  . 
In  the  following  baseline  design,  we  will  make  the  as¬ 
sumption  that  the  disturbance  w  is  periodic  with  a 
known  period  and  can  be  measured  .  Even  with  these 
stringent  assumptions  deriving  a  discrete-time  imple¬ 
mentation  of  the  feedback  linearization  based  model- 
follower  scheme  turned  out  to  be  a  nontrivial  task. 
In  fact,  even  if  to  is  assumed  to  be  measured,  the 
disturbance  does  not  come  in  additive  at  the  plant 
input;  canceling  the  effect  of  to  at  y  requires  higher 
order  derivatives  of  to  .  Since  the  performance  de¬ 
pends  on  the  success  of  cancellation,  the  derivatives 
are  not  constructed  by  filtered/delayed  versions  of  to  ; 
instead,  an  identifier  is  built  to  determine  the  Fourier 
coefficients  of  the  harmonics  of  interest  and  the  sig¬ 
nal  generator  outputs  (ti,  y)  are  derived  accordingly. 
Throughout  this  process,  the  inner-loop  controller  is 
chosen  as  a  simple  gain  C  =  10  ;  the  inner-loop  is  in¬ 
herently  stable,  and  the  to  identifier  generates  the  to 
information  used  in  the  signal  generator.  Through¬ 
out  the  following,  there  is  a  nonlinearity  mismatch  ; 
the  plant  model  in  the  inner-loop  uses  /  ,  whereas 

the  signal  generator  uses  the  smooth  version  /  .  The 
parameters  satisfy  6  —  6  —  0.3  .  A  continuous- 
time  design  was  completed  ;  the  next  step  was  to  re¬ 
cover  the  continuous-time  simulation  results  using  a 
discrete-time  controller.  A  maximum  sampling  rate 
of  400  Hz  is  dictated  by  hardware  limitations.  The 
minimum  step-size  of  2.5  ms  was  not  small  enough 
to  reasonable  replicate  the  nonlinear  ordinary  differ¬ 
ential  equation  solution  with  any  fixed  step-size  inte¬ 
gration  algorithm.  The  final  successful  discretization 
was  based  on  the  4th  order  Runge-Kutta  algorithm, 
which  requires  intersample  point  evaluation.  In  order 
to  avoid  oversampling,  the  controller  was  decomposed 
into  an  LTI  system  in  feedback  with  algebraic  nonlin¬ 
earities  and  the  LTI  subsystem  was  discretized  using 
the  4th  order  Runge-Kutta  algorithm,  introducing  ad¬ 
ditional  states  as  many  as  the  inputs,  to  avoid  inter¬ 
sampling.  Using  this  particular  discretization  at  400 
Hz,  the  continuous-time  performance  was  sufficiently 
reproduced  (see  Figures  13  and  14)  . 

The  two  strip  plots  in  Figure  13  correspond  to  the 
tip  position  before  and  after  the  model-follower  is  in¬ 
cluded  ;  the  base  is  commanded  a  -3  mrad  slew.  The 
top  strip  is  the  tip  position  for  the  inner-loop  controller 
C  =  10  ,  u  =  0  ,  y  =  —  3  mrad  (see  Figure  10)  .  The 
disturbance  ui  is  periodic.  The  lower  strip  corresponds 
to  the  case  where  to  is  identified  up  to  the  first  7  har¬ 
monics  and  the  th  information  is  used  in  the  signal 
generator  to  command  the  bascwhecl  a  -3  mrad  slew 
and  to  cancel  the  effect  of  the  7  harmonics  .  Initially, 
the  identifier  coefficients  are  all  zero  ;  it  takes  approxi¬ 
mately  1  second  for  the  parameters  to  converge.  After 


1  second,  the  base  is  at  -3  mrad  and  the  tip  has  ap¬ 
proximately  ±1  mrad  deviation  (see  lower  strip,  Fig¬ 
ure  13)  . 


Figure  14: 

Since  the  performance  plots  in  Figure  13  do  not  cor¬ 
respond  to  an  LTI  system,  one  cannot  represent  the 
reduction  of  the  harmonic  term  contributions  directly. 
Although  a  transfer  function  description  docs  not  ex¬ 
ist,  we  can  still  compare  the  frequency  contents  of  each 
signal  to  substantiate  the  reduction  of  the  harmonics 
at  the  tip.  Using  the  discrete  Fourier  transforms  of 
the  two  signals  in  Figure  13  ,  we  obtain  Figure  14  . 
The  dashed  and  solid  lines,  respectively  denote  before 
and  after  the  model-follower  is  introduced.  The  dis¬ 
turbance  is  4  Hz  periodic,  and  the  identifier  was  built 
for  the  first  seven  harmonics.  Hence  the  improvements 
are  apparent  up  to  28  Hz  ,  with  reductions  26,  26,  26, 
16,  16,  16  and  10  dB  at  4,  8,  12,  16,  20,  24  and  28  Hz, 
respectively. 
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